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(54) DNA Diagnostics based on mass spectrometry 



(57) Fast and highly accurate mass spectrometry- 
based processes for detecting a particular nucleic acid 
sequence In a biological sample are provided. Depend- 
ing on the sequence to be detected, the processed can 
be used, for example, to diagnose a genetic disease or 
chromosomal abnormality; a predisposition to a disease 
or condition, infection by a pathogenic organism, or for 
determining Identity or heredity. 

A method and apparatus for creating multiple 
branch weRs from a parent well is disclosed. A multiple 
branching sub is provided for placement at a branching 
node of a well. Such sub includes a branching chamber 
and a plurality of branching outlet members. The outlet 
members during construction of the branching sub, 



have previously been distorted into oblong shapes so 
that all of the branching outlet members fit within an im- 
aginary cylinder which is coaxial with and substantially 
the same radius as the branching chamber. After de- 
ployment of the branching sub via a parent casing in the 
well, a forming tool is lowered to the Interior of the sub. 
The outlet members are extended outwardly by the 
forming tool and simultaneously formed into substantial- 
ly round tubes. Next, each outlet member is plugged 
with cement, after which each branch well is drilled 
through a respective outlet member. If desired, each 
branch may be lined with casing and sealed to a branch- 
ing outlet by means of a casing hanger. A manifold 
placed in the branching chamber controls the production 
of each branch well to the parent well. 
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Description 
Related Applications 

[0001] For U.S. National Stage purposes, this application is a continuation-in-part of U.S. application Serial No. 
08/744,481 , filed November 6, 1 996, to Kdster, entitled "DNA DIAGNOSTICS BASED ON MASS SPECTROMETRY". 
This application is also a continuation-in-part of U.S. application Serial Nos. 08/744,590, 08/746,036, 08/746,055, 
08/786,988, 08/787,639, 08/933,792 and U.S. application Serial No. atty dkt. no. 7352-2001B, filed October B, 1997, 
which is a continuation-in-part of U.S. application Nos. 08/746,055, 08/786,988 and 08/787,639. For international pur- 
poses, benefit of priority is claimed to each of these applications. 

[0002] This application is related to U.S. Patent Application Serial No. 08/61 7,256 filed on March 18, 1996, which is 
a continuation-in-part of U.S. application Serial No. 08/406,199, filed March 17, 1995, now U.S. Patent No. 5,605,798, 
and is also related U.S. Patent Nos. 5,547,835 and 5,622,824. 

[0003) Where permitted the subject matter of each of the above-noted patent applications and the patent is herein 
incorporated in its entirety. 

BACKGROUND OF THE INVENTION 

Detection of mutations 

[0004] The genetic information of atl living organisms (e.g. , animals, plants and microorganisms) is encoded in de- 
oxyribonucleic acid (DNA). In humans, the complete genome is contains of about 100,000 genes located on 24 chro- 
mosomes (The Human Genome, T. Strachan, BIOS Scientific Publishers, 1 992). Each gene codes for a specific protein, 
which after its expression via transcription and translation, fulfills a specific biochemical function within a living cell. 
Changes in a DNA sequence are known as mutations and can result in proteins with altered or in some cases even 
lost biochemical activities; this In turn can cause genetic disease. Mutations include nucleotide deletions, Insertions or 
alterations (i.e. point mutations). Point mutations can be either "missense", resulting in a change in the amino acid 
sequence of a protein or "nonsense" coding for a stop codon and thereby leading to a truncated protein. 
[0005] More than 3000 genetic diseases are currently known (Human Genome Mutations, D. N. Cooper and M. 
Krawczak, BIOS Publishers, 1993), including hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), Hunt- 
ington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF). In addition to mutated genes, which result in 
genetic disease, certain birth defects are the result of chromosomal abnormalities such as Trisomy 21 (Down's Syn- 
drome), Trisomy 1 3 (Patau Syndrome), Trisomy 1 8 (Edward's Syndrome), Monosomy X (Turner's Syndrome) and other 
sex chromosome aneuploidies such as Klienfelter's Syndrome (XXY). Further, there is growing evidence that certain 
DNA sequences may predispose an individual to any of a number of diseases such as diabetes, arteriosclerosis, 
obesity, various autoimmune diseases and cancer (e.g. , colorectal, breast, ovarian, lung). 

[0006] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host cell. Therefore, infectious organisms can also be detected and 
identified based on their specific DNA sequences. 

[0007] Since the sequence of about 1 6 nucleotides is specific on statistical grounds even for the size of the human 
genome, relatively short nucleic acid sequences can be used to detect normal and defective genes in higher organisms 
and to detect infectious microorganisms ( e.g. , bacteria, fungi, protists and yeast) and viruses. DNA sequences can 
even serve as a fingerprint for detection of different individuals within the same species (see, Thompson, J. S. and M. 
W. Thompson, eds., Genetics in Medicine , W.B. Saunders Co., Philadelphia, PA (1991)). 

[0008] Several methods for detecting DNA are currently being used. For example, nucleic acid sequences can be 
identified by comparing the mobility of an amplified nucleic acid fragment with a known standard by gel electrophoresis, 
or by hybridization with a probe, which is complementary to the sequence to be Identified. Identification, however, can 
only be accomplished if the nucleic acid fragment is labeled with a sensitive reporter function (e.g. , radioactive 
I 32 ?, ^S), fluorescent orchemlluminescent). Radioactive labels can be hazardous and the signals they produce decay 
over time. Non-lsotopic labels (e.g. , fluorescent) suffer from a lack of sensitivity and fading of the signal when high 
intensity lasers are being used. Additionally, performing labeling, electrophoresis and subsequent detection are labo- 
rious, time-consuming and error-prone procedures. Electrophoresis is particularly error-prone, since the size or the 
molecular weight of the nucleic acid cannot be directly correlated to the mobility in the gel matrix. It is known that 
sequence specific effects, secondary structure and interactions with the gel matrix are causing artefacts. 

Use of mass spectrometry for detection and Identification of nucleic acids 

[0009] Mass spectrometry provides a means of "weighing" individual molecules by ionizing the molecules in vaccuo 
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and making them "fly* by volatilization. Under the influence of combinations of electric and magnetic fields, the ions 
follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular 
weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characteri- 
zation of organic molecules by the determination of the mass of the parent molecular ion. In addition, by arranging 
collisions of this parent molecular Ion with other particles ( e.g. , argon atoms), the molecular ion Is fragmented forming 
secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very often 
allows the derivation of detailed structural Information. Many applications of mass spectrometric methods are known 
in the art, particularly In biosciences (see, ejj.. Methods in Enzymol. , Vol. 1 93: "Mass Spectrometry" (J. A. McCloskey, 
editor), 1990, Academic Press, New York). 

[0010] Because of the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, 
accuracy of mass measurements, detailed structural Information by CID in conjunction with an MS/MS configuration 
and speed, as well as on-line data transfer to a computer, there has been interest in the use of mass spectrometry for 
the structural analysis of nucleic acids. Recent reviews summarizing this field Include K.H. Schram, "Mass Spectrometry 
of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry" 34, 203-287 (1 990); and P.R Crain, "Mass 
Spectrometric Techniques In Nucleic Acid Research," Mass Spectrometry Reviews 9. 505-554 (1990); see, also U.S. 
Patent No. 5,547,835 and U.S. Patent No. 5,622,824). 

[0011] Nucleic acids, however, are very polar biopolymers that are very difficult to volatilize. Consequently, mass 
spectrometric detection has been limited to low molecular weight synthetic oligonucleotides for confirming an already 
known oligonucleotide sequence by determining the mass of the parent molecular ion, or alternatively, confirming a 
known sequence through the generation of secondary ions (fragment ions) via CID in an MS/MS configuration using, 
in particular, lor the ionization and volatilization, the method of fast atomic bombardment (FAB mass spectrometry) or 
plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis of protected dimeric 
blocks for chemical synthesis of oligodeoxynucleotides has been described (Koster et al (1987) Biomed. Environ. 
Mass Spectrometry 14 , 111-116). 

[001 2] Other ionizatlon/desorption techniques include electros pray/ton -spray (ES) and matrix-assisted laser desorp- 
tion/lonization (MALDl). ES mass spectrometry has been introduced by Fenn et al. (J. Phys. Chem. 88 :4451 -59 (1 984); 
PCT Application No. WO 90/14148) and current applications are summarized in review articles (see, e.g. , Smith et al 
(1990) Anal. Chem . 62:882-89 and Ardrey (1992) Electrospray Mass Spectrometry, Spectroscopy Europe 4:10-18). 
The molecular weights of a tetradecanucleotide (see, Covey et aL (1 98B) The "Determination of Protein, Oligonucleotide 
and Peptide Molecular Weights by lonspray Mass Spectrometry," Rapid Commun. in Mass Spectrometry 2:249-256), 
and of a 21-mer (Methods in Enzymol. , 193 , "Mass Spectrometry* (McCloskey, editor), p. 425, 1 990, Academic Press, 
New York) have been published. As a mass analyzer, a quadrupole is most frequently used. Because of the presence 
of multiple ion peaks that all could be used for the mass calculation, the determination of molecular weights in femtomole 
amounts of sample is very accurate. 

[0013] MALDl mass spectrometry, in contrast, can be attractive when a time-of-flfght (TOF) configuration (see, 
Hillenkamp et al. (1 990) pp 49-60 in "Matrix Assisted UV-Laser Desorption/Ionization: A New Approach to Mass Spec- 
trometry of Large Biomolecules," Biological Mass Spectrometry , Burtingame and McCloskey, editors, Elsevier Science 
Publishers, Amsterdam) is used as a mass analyzer. Since, in most cases, no multiple molecular ion peaks are produced 
with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry. 
[0014] Although DNA molecules up to a molecular weight of 410,000 daltons have been desorbed and volatilized 
(Williams et aJ. f "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen Aqueous Solutions," 
Science 246, 1585-87 (1 9B9)), this technique had only shown very low resolution (oligothymidylic acids up to 18 nucle- 
otides, Huth-Fehre et aj. Rapid Commun. in Mass Spectrom. . 6, 209-13 (1 992); DNA fragments up to 500 nucleotides 
in length K. Tang ef a/., Rapid Commun. in Mass Spectrom. , 8, 727-730 (1 994); and a double-stranded DNA of 28 base 
pairs (Williams et at, "Tlme-of-Flight Mass Spectrometry of Nucleic Acids by Laser Ablation and Ionization from a Frozen 
Aqueous Matrix," Rapid Commun. in Mass Spectrom. , 4, 348-351 (1990)). Japanese Patent No. 59-131909 describes 
an instrument, which detects nucleic acid fragments separated either by electrophoresis, liquid chromatography or high 
speed gel filtration. Mass spectrometric detection is achieved by incorporating into the nucleic acids, atoms, such as S, 
Br, I or Ag, Au, Pt, Os, Hg, that normally do not occur in DNA. 

[0015] Co-owned U.S. Patent No. 5,622,824 describes methods for DNA sequencing based on mass spectrometric 
detection. To achieve this, the DNA is by means of protection, specificity of enzymatic activity, or immobilization, uni- 
laterally degraded in a stepwise manner via exonuclease digestion and the nucleotides or derivatives detected by mass 
spectrometry. Prior to the enzymatic degradation, sets of ordered deletions that span a cloned DNA fragment can be 
created. In this manner, mass-modified nucleotides can be incorporated using a combination of exonuclease and DNA/ 
RNA polymerase. This permits either multiplex mass spectrometric detection, or modulation of the activity of the exo- 
nuclease so as to synchronize the degradative process. Co-owned U.S. Patent Nos. 5,605,798 and 5,547,835 provide 
methods for detecting a particular nucleic acid sequence in a biological sample. Depending on the sequence to be 
detected, the processes can be used, for example, In methods of diagnosis. These methods, while broadly useful and 
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applicable to numerous embodiments, represent the first disclosure of such applications and can be improved upon. 
[001 6} Therefore, it is an object herein to provided improved methods for sequencing and detecting DNA molecules 
in biological samples. It is also an object herein to provided improved methods for diagnosis of genetic diseases, 
predispositions to cetain diseases, cancers, and infections. 

SUMMARY OF THE INVENTION 

[0017) Methods of diagnosis by detecting and/or deteiming sequences of nucleic acids that are based on mass 
spectrometry are provided herein. Methods are provided for detecting double-stranded DNA, detecting mutations and 
other diagnostic markers using MS analysis. In particular, methods for diagnosing neuroblastoma, detecting heredity 
relationships, HLA compatibility, genetic fingerprinting, detecting teleromase activity for cancer diagnosis are provided. 
[0018] In certain embodiments the DNA Is immobilized on a solid support either directly or via a linker and/or bead. 
Three permutions of the methods for DNA detection in which immobilized DNA is used are exemplified. These include: 
(1) immobilization of a template; hybridization of the primer; extension of the primer, or extension of the primer (single 
ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing); (2) 
immobilization of a primer; hybridization of a single stranded template; and extension of the primer, or extension of the 
primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (se- 
quencing); (3) immobilization of the primer; hybridization of a double stranded template; extension of the primer, or 
extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease 
degradation (sequencing). 

[001 9] In certain embodiments the DNA is immobilized on the support via a selectively cleavable linker. Selectively 
cleavable linkers indude, buta are not limited to photocleavable linkers, chemically cleavable linkers and an enzymat- 
icalry (such as a restriction site (nucleic acid linker), a protease site) cleavable lin kers. Inclusion of a selectively cleavable 
linker expands the capabilities of the MALDI-TOF MS analysis because it allows for all of the permutations of immo- 
bilization of DNA for MALDI-TOF MS, the DNA linkage to the support through the 3'- or 5'-end of a nucleic acid; allows 
the amplified DNA or the target primer to be extended by DNA synthesis; and further allows for the mass of the extended 
product (or degraded product via exonuclease degradation) to be of a size that is appropriate for MALDI-TOF MS 
analysis (i.e., the isolated or synthesized DNA can be large and a small primer or a large primer sequence can be used 
and a small restriction fragment of a gene or single strand thereof hybridized thereto). 

[0020] In a preferred embodiment, the selectively cleavable linker is a chemical or photocleavable linker that is 
cleaved during the ionizing step of mass spectrometry. Exemplary linkers include linkers containing, a disulfide group, 
a leuvinyi group, an acid-labile trityl group and a hydrophobic trityl group. In other embodiments, the enzymaticalfy 
cleavable tinker can be a nucleic acid that is an RNA nucleotide or that encodes a restriction endonuclease site. Other 
enzymaticaJly cleavable linkers include linkers that contain a pyrophosphate group, an arginine-arginine group and a 
lysine-lysine group. Other linkers are exemplified herein. 

[0021] Methods for sequencing long fragments of DNA are provided. To perform such sequencing, specific base 
terminated fragments are generated from a target nucleic acid. The analysis of fragments rather than the full length 
nucleic acid shifts the mass of the ions to be determined into a lower mass range, which is generally more amenable 
to mass spectometric detection. For example, the shift to smaller masses Increases mass resolution, mass accuracy 
and, in particular, the sensitivity for detection. Hybridization events and the actual molecular weights of the fragments 
as determined by mass spectrometry provide sequence information (ejj.. the presence anoVor identity of a mutation). 
In a preferred embodiment, the fragments are captured on a solid support prior to hybridization and/or mass spectrom- 
etry detection, in another preferred embodiment, the fragments generated are ordered to provide the sequence of the 
larger nucleic acid. 

[0022] One preferred method for generating base specifically terminated fragments from a nucleic acid Is effected 
by contacting an appropriate amount of a target nucleic acid with an appropriate amount of a specific endonuclease, 
thereby resulting in partial or complete digestion of the target nucleic acid. Endonucleases will typically degrade a 
sequence into pieces of no more than about 50-70 nucleotides, even if the reaction is not run to full completion. In a 
preferred embodiment, the nucleic acid is a ribonucleic acid and the endonuclease is a ribonuclease (RNase) selected 
from among: the G-speclflc RNase T 1t the A-specrfic RNase U 2 , the A/U specific RNase PhyM, U/C specific RNase 
A, C specific chicken liver RNase (RNase CL3) or crisavitln. In another preferred embodiment, the endonuclease is a 
restriction enzyme that cleaves at least one site contained within the target nucleic acid. Another preferred method for 
generating base specifically terminated fragments includes performing a combined amplification and base-specific 
termination reaction (e^., using an appropriate amount of a first DNA polymerase, which has a relatively low affinity 
towards the chain-terminating nucleotides resulting in an exponential amplification of the target; and a polymerase with 
a relatively high affinity for the chain terminating nucleotide resulting in base-specific termination of the polymerization. 
Inclusion of a tag at the 5' and/or 3* end of a target nucleic acid can facilitates the ordering of fragments. 
[0023] Methods for determining the sequence of an unknown nucleic acid in wnfch the 5' and/or 3* end of the target 
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nucleic acid can include a tag are provided. Inclusion of a non-natural tag on the 3* end is also useful for ruling out or 
compensating for the influence of 3' heterogeneity, premature termination and nonspecific elongation. In a preferred 
embodiment, the tag Is an affinity tag (e.g. , blotin or a nucleic acid that hybridizes to a capture nucleic acid). Most 
preferably the affinity tag facilitates binding of the nucleic acid to a solid support. In another preferred embodiment, the 
tag is a mass marker (i.e. a marker of a mass that does not correspond to the mass of any of the four nucleotides). In 
a further embodiment, the tag is a natural tag, such as a polyA tall or the natural 3' heterogeneity that can result, for 
example, from a transcription reaction. 

[00241 Methods of sequence analysis in which nucleic acids have been replicated from a nucleic acid molecule 
obtained from a biological sample are specifically digested using one or more nucleases (deoxyribonucleases for DNA, 
and ribonucleases for RNA) are provided. The fragments captured on a solid support carrying the corresponding com- 
plementary sequences. Hybridization events and the actual molecular weights of the captured target sequences provide 
information on mutations in the gene. The array can be anafyzed spot-by-spot using mass spectrometry. Further, the 
fragments generated can be ordered to provide the sequence of the larger target fragment. 

[0025] In another embodiment, at least one primer with a 3'-terminal base is hybridized to the target nucleic acid 
near a site where possible mutations are to be detected. An appropriate polymerase and a set of three nucleoside 
triphosphates (NTPs) and the fourth added as a terminator are reacted. The extension reaction products are measured 
by mass spectrometry and are indicative of the presence and the nature of a mutation. The set of three NTPs and one 
dd-NTP (or three NTPs and one 3'-deoxy NTP), will be varied to be able to discriminate between several mutations 
(including compound heterozygotes) in the target nucleic acid sequence. 

[0026] Methods for detecting and diagnosing neoplasia/malagnancies in a tissue or cell sample are provided. The 
methods rely on a teiomeric repeat amplification protocol (TRAP) -MS assay and include the steps of: 

a) obtaining a tissue or a cell sample, such as a clinical isolate or culture of suspected cells; 

b) isolating/extracting/purifying tefomerase from the sample; 

c) adding the telomerase extract to a composition containing a synthetic DNA primer, which is optionally immobi- 
lized, complementary to the teiomeric repeat, and all four dNTPs under conditions that result in telomerase specific 
extension of the synthetic DNA; 

d) amplifying the telomerase extended DNA products,, preferably using a primer that contains a 'linker moiety", 
such as a moiety based on thiol chemistry or streptavidin; 

e) isolating linker-amplified primers, such as by using a complementary binding partner immobilized on a solid 
support; 

f) optionally conditioning the DNA for crystal formation; and 

g) performing MS by fonfzing/Volatizing the sample to detect the DNA product. 

Telomerase-specif ic extension Is indicative of neoplalsa/malignancy. 

This method can be used to detect ect specific malignancies. The use of MS to detect the DNA product permits iden- 
tification the extended product, which is indicative of telomerase activity in the sample. If desired, the synthetic DNA 
can be in the form an array. 

[0027] Methods for detecting mutations are provided and the use thereof oncogenes and to thereby screen for trans- 
formed cells, which are indicative of neoplasia. Detection of mutations present in oncogenes are indicative of trans- 
formation. This method includes the steps of: 

a) obtaining a biological sample; 

b) amplifying a portion of the selected proto-oncogene that includes a codon Indicative of transformation, where 
one primer has a linker moiety for immobilization; 

c) immobilizing DNA via the linker moiety to a solid support, optionally In the form of an array; 

d) hybridizing a primer complementary to the proto oncogene sequence that Is upstream from the codon 

e) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP location; 
0 ionizing/volatizing the sample; and 

g) detecting the mass of the extended DNA, whereby mass indicates the presence of wild-type or mutant alleles. 
The presence of a mutant allele at the codon is diagnostic for neoplasia. 

In an exemplary embodiment, extension-MS analysis Is used detect the presence of a mutated codon 634 in the ret- 
rovirus (RET)-proto oncogene. 

[0028] In another embodiment, methods for diagnosing diseases using reverse transcription and amplification of a 
gene expressed in transformed cells. In particular, a method for diagnosis of neuroblastoma using reverse transcriptase 
(RT> MS of tyrosine hydroxylase, which is a catecholamine blosynthetic enzyme that expressed in tumor cells, but not 
in normal cells, such as normal bone marrow cells is provided. The method includes the steps of: 
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a) obtaining a tissue sample; 

b) isolating pofyA RNA from the sample; 

c) preparing a cDNA library using reverse transcription; 

d) amplifing a cDNA product, or portion thereof, of the selected gene, where one oligo primer has a linker moiety; 

e) isolating the amplified product by Immobilizing the DNA to solid support via the linker moiety; 

f) optionally conditioning the DNA: 

g) ionizlng/volatizing sample and detecting the presence of a DNA peak that is Indicative of expression of the 
selected gene gene. For example, expression of the tyrosine hydroxylase gene is indicative of neuroblastoma. 

[0029] Also provided are methods of directly detecting a double-stranded nucleic acid using MALDI-TOF MS. These 
methods include the steps of: 

a) isolating a double stranded DNA of an appropriate size for MS via amplification methods or formed by hybridi- 
zation of single-stranded DNA fragment; 

b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA in 
which the conditions include one or all of the following: preparing samples for analysis at reduced temperatures 
(i.e. 4 ° C), and using of higher DNA concentrations in the matrix to drive duplex formation 

c) ionizing/volatizing the sample of step b), where this step uses low acceleration voltage of the ions to assist in 
maintaining duplex DNA by, for example, adjusting laser power to just above threshold irradiation for ionization, and 

d) detecting the presence of the dsDNA of the appropriate mass. 

In preferred embodiments, the matrix includes 3-hydroxyplcoIinic acid. The detected DNA can be indicative of a genetic 
disorder, genetic disease, genetic predisposition to a disease chromosomal abnormalities. In other embodiments, the 
mass of the double stranded DNA is indicative of the deletion, insertion, mutation. 

[0030] A method designated primer oligo base extension (PROBE) is provided. This method uses a single detection 
primer followed by an oligonucleotide extension step to give products, which can be readily reserved by MALDI-TOF 
mass spectrometry. The products differ in length by a number of bases specific for a number of repeat units or for 
second site mutations within the repeated region. The method is exemplified using as a model system the AluVpA 
polymorphism in intron 5 of the interferon-a receptor gene located on human chromosome 21 , and the poly T tract of 
the splice acceptor site of intron 8 from the CFTR gene located on human chromosome 7. The method is advanta- 
geously used for example, for determining identity, identifying mutations, familial relationship, HLA compatabilrty and 
other such markers .using PROBE-MS analysis of microsatellite DNA. In a preferred embodiment, the method includes 
the steps of: 

a) obtaining a biological sample from two individuals; 

b) amplifying a region of DNA from each individual that contains two or more microsatellite DNA repeat sequences 

c) ionizing/voJatizing the amplified DNA; 

d) detecting the presence of the amplified DNA and comparing the molecular weight of the amplified DNA. Different 
sizes are indicative of non-identity (he^ wild-type versus mutation), non-heredity or non-compatibility; similar size 
fragments indicate the possibility identity, of familial relationship, or HLA compatibility. 

[0031] More than one marker may be examined simulataneouisy, primers with different linker moieties are used for 
immobilization. 

[0032] Another method loop-primer oligo base extension, designated LOOP-PROBE, for detection of mutations es- 
pecially predominant disease causing mutations or common polymorphisms is provided. In a particular embodiment, 
this method for detecting target nucleic acid in a sample, includes the steps of: 

a) amplifying a target nucleic acid sequence, such as p-globin, in a sample, using (I) a first primer whose 5'-end 
shares identity to a portion of the target DNA immediately downstream from the targeted codon followed by a 
sequence that introduces a unique restriction endonuclease site, such as Cfol In the case of 0-globin, into the 
ampltaon and whose 3' -end primer Is self-complementary; and (II) a second downstream primer that contains a 
tag, such as biotin, for immobilizing the DNA to a solid support, such as streptavidin beads; 

c) immobilizing the double-stranded amplified DNA to a solid support via the linker moiety; 

d) denaturing the immobilized DNA and Isolating the non-Immobilized DNA strand; 

e) annealing the intracomplementary sequences in the 3*-end of the isolated non-immobilzed DNA strand, such 
that the 3'-end is extendable by a polymerase, which annealing can be performed, for example, by heating then 
and cooling to about 37* C, or other suitable method; 

0 extending the annealed DNA by adding DNA polymerase, 3 dNTPs/t ddNTP, whereby the 3'-end of the DNA 
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strand is extended by the DNA polymerase to the position of the nextddNTP location (re., to the mutation location); 
g) cleaving the extended double stranded stem loop DNA with the unique restriction endonuclease and removing 
the cleaved stem loop DNA 

i) (optionally adding a matrix) ionizing/volatizing the extended product; and 

]) detecting the presence of the extended target nucleic acid, whereby the presence of a DNA fragment of a mass 
different from wild-type Is Indicative of a mutation at the target codon(s). 

This method eliminates one specific reagent for mutation detection compared other methods of MS mutational analyses, 
thereby simplifying the process and rendering it amenable to automation. Also, the specific extended product that is 
analyzed is cleaved from the primer and is therefore shorter compared to the other methods. In addition, the annealing 
efficiency is higher compared to annealing of an added primer and should therefore generate more product. The process 
Is compatible with multiplexing and various detection schemes ( e.g. , single base extension, ollgo base extension and 
sequencing). For example, the extension of the loop-primer can be used for generation of short diagnostic sequencing 
ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as well as species typing. 
[0033] In another emodiment, a methods of detecting a target nucleic acid in a biological sample using RNA ampli- 
fication is provided. In the method, the target is amplified the target nucleic acid, using a primer that shares a region 
complementary to the target sequence and upstream encodes a promoter, such as the T7 promoter A DNA-dependent 
RNA polymerase and appropriate ribonucleotides are added to synthesize RNA, which is analyzed by MS. 
[0034] Improved methods of sequencing DNA using MS are provided. In these methods thermocycling for amplifi- 
cation is used prior to MS analysis, thereby increasing the signal. 

[0035] Also provide are primers for use in MS analyses. In particular, primers, comprising ail or, for longer oligonu- 
cleotides, at least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences set forth in 
SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 9B, 101-110, 112-123, 126, 128, 129, and primers set forth in SEQ 
ID Nos. 280-287. The primers are unlabeled, and optionally include a mass modifying moiety, which is preferably 
attached to the 5'end. 

[0036] Other features and advantages of the methods provided herein will be further described with reference to the 
following Figures, Detailed Description and Claims. 

BRIEF DESCRIPTION OF THE FIGURES 

[0037] FIGURE 1A is a diagram showing a process for performing mass spectrometric analysis on one target de- 
tection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence Is chosen 
to specifically hybridize with a complementary sequence on the target nudeic acid molecule (T), known as the target 
capture site (TCS). The spacer (S) facilitates unhindered hybridization. A detector nucleic acid sequence (D), which 
is complementary to the TDS is then contacted with the TDS. Hybridization between D and the TDS can be detected 
by mass spectrometry. 

[0038] FIGU RE 1 B is a diagram showing a process for performing mass spectrometric analysis on at least one target 
detection site (here TDS 1 and TDS 2) via direct linkage to a solid support. The target sequence (T) containing the 
target detection site (TDS 1 and TDS 2) is immobilized to a solid support via the formation of a reversible or irreversible 
bond formed between an appropriate functionality (L*) on the target nucleic acid molecule (T) and an appropriate func- 
tionality (L) on the solid support. Detector nucleic acid sequences (here D1 and 02), which are complementary to a 
target detection site (TDS 1 or TDS 2) are then contacted with the TDS. Hybridization between TDS 1 and D1 and/or 
TDS 2 and D2 can be detected and distinguished based on molecular weight differences. 

[0039] FIGURE 1 C Is a diagram showing a process for detecting a wildtype (D**) and/or a mutant (D mut ) sequence 
in a target (T) nucleic acid molecule. As in Figure 1 A, a specific capture sequence (C) is attached to a solid support 
(SS) via a spacer (S). In addition, the capture sequence is chosen to specifically interact with a complementary se- 
quence on the target sequence (T). the target capture site (TCS) to be detected through hybridization. If the target 
detection site (TDS) includes a mutation, X, etection sites can be distinguished from wildtype by mass spectrometry. 
Preferably, the detector nucleic acid molecule (D) is designed so that the mutation is In the middle of the molecule and 
therefore would not lead to a stable hybrid if the wildtype detector oligonucleotide (D**) is contacted with the target 
detector sequence, ejj.. as a control. The mutation can also be detected if the mutated detector oligonucleotide (D™"*) 
with the matching base at the mutated position is used for hybridization. If a nucleic acid molecule obtained from a 
biological sample is heterozygous for the particular sequence (I.e. contain D** and D m «»), D** and D mut will be bound 
to the app and D 1 ™ 1 to be detected simultaneously. 

[0040] FIGURE 2 is a diagram showing a process in which several mutations are simultaneously detected on one 
target sequence molecular weight differences between the detector oligonucleotides D1 , D2 and D3 must be large 
enough so that simultaneous detection (multiplexing) is possible. This can be achieved either by the sequence itself 
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(composition or length) or by the introduction of mass-modifying functionalities M1 -M3 into the detector oligonucleotide. 
[00411 FIGURE 3 is a diagram showing still another multiplex detection format. In this embodiment, differentiation 
is accomplished by employing different specific capture sequences which are position-specifically immobilized on a 
flat surface (e.g. , a 'chip array 1 ). If different target sequences T1 -Tn are present, their target capture sites TCS1 -TCSn 
will interact with complementary immobilized capture sequences Cl-Cn. Detection is achieved by employing appro- 
priately mass differentiated detector oligonucleotides D1-Dn, which are mass differentiated either by their sequences 
or by mass modifying functionalities M1-Mn. 

[0042] FIGURE 4 is a diagram showing a format wherein a predesigned target capture site (TCS) is incorporated 
into the target sequence using nucleic acid (i.e. , PCR) amplification. Only one strand is captured, the other is removed 
based on the interaction between biotin and streptavidln coated magnetic beads). If the biotln is attached to 
primer 1 the other strand can be appropriately marked by a TCS. Detection is as described above through the interaction 
of a specif fc detector oligonucleotide D with the corresponding target detection site TDS via mass spectrometry. 
[0043] FIGURES is a diagram showing how amplification (here ligase chain reaction (LCR)) products can be prepared 
and detected by mass spectrometry. Mass differentiation can be achieved by the mass modifying functionalities (M1 
and M2) attached to primers (P1 and P4 respectively). Detection by mass spectrometry can be accomplished directly 
(i.e. without employing immobilization and target capturing sites (TCS)). Multiple LCR reactions can be performed in 
parallel by providing an ordered array of capturing sequences (C). This format allows separation of the ligation products 
and spot by spot identification via mass spectrometry or multiplexing if mass differentiation is sufficient. 
[0044] FIGURE 6A is a diagram showing mass spectrometric analysis of a nucleic acid molecule, which has been 
amplified by a transcription amplification procedure. An RNA sequence is captured via its TCS sequence, so that 
wildtype and mutated target detection sites can be detected as above by employing appropriate detector oligonucle- 
otides (D). 

[0045] FIGURE 6B is a diagram showing multiplexing to detect two different (mutated) sites on the same RNA in a 
simultaneous fashion using mass-modified detector oligonucleotides M1-D1 and M2-D2. 

[0046] FIGURE 6C is a diagram of a different multiplexing procedure for detection of specific mutations by employing 
mass modified dideoxynucleoside or 3'-deoxynucleoside triphosphates and an RNA dependent DNA polymerase. Al- 
ternatively, DNA dependent RNA polymerase and ribonucleotide phosphates can be empfoyed. This format allows for 
simultaneous detection of all four base possibilities at the site of a mutation (X). 

[0047] FIGURE 7A is a diagram showing a process for performing mass spectrometric analysis on one target de- 
tection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. 
A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen 
to specifically hybridize with a complementary sequence on T known as the target capture site (TCS). A nucleic add 
molecule that is complementary to a portion of the TDS is hybridized to the TDS 5' of the site of a mutation (X) within 
the TDS. The addition of a complete set of dideoxynucieosides or 3'-deoxy nucleoside triphosphates (e.g. , pppAdd, 
pppTdd, pppCdd and pppGdd) and a DNA dependent DNA or RNA polymerase allows for the addition only of the one 
dideoxynucleoside or 3'-deoxy nucleoside triphosphate that is complementary to X. 

[0048] FIGURE 7B is a diagram showing a process for performing mass spectrometric analysis to determine the 
presence of a mutation at a potential mutation site (M) within a nucleic acid molecule. This format allows for simulta- 
neous analysis of alleles (A) and (B) of a double stranded target nudeic acid molecule, so that a diagnosis of ho- 
mozygous normal, homozygous mutant or heterozygous can be provided. Allele A and B are each hybridized with 
complementary oligonucleotides ((C) and (D) respectively), that hybridize to A and B within a region that includes M. 
Each heteroduplex is then contacted with a single strand specific endonuclease, so that a mismatch at M f indicating 
the presence of a mutation, results in the cleavage of (C) anoVor (D), which can then be detected by mass spectrometry. 
[0049] FIGURE 8 is a diagram showing how both strands of a target DNA can be prepared for detection using tran- 
scription vectors having two different promoters at opposite locations (e.g. , the SP6 and T7 promoter). This format is 
particularly useful for detecting heterozygous target detections sites (TDS). Employing the SP6 or the T7 RNA polymer- 
ase both strands could be transcribed separately or simultaneously. The transcribed RNA molecules can be specifically 
captured and simultaneously detected using appropriately mass-differentiated detector oligonucleotides. This can be 
accomplished either directly in solution or by parallel processing of many target sequences on an ordered array of 
specifically immobilized capturing sequences. 

[0050] FIGURE 9 is a diagram showing how RNA prepared as described in Figures 6, 7 and 8 can be specifically 
digested using one or more ribonucleases and the fragments captured on a solid support carrying the corresponding * 
complementary sequences. Hybridization events and the actual molecular weights of the captured target sequences 
provide information on whether and where mutations In the gene are present The array can be analyzed spot by spot 
using mass spectrometry. DNA can be similarly digested using a cocktail of nucleases Including restriction endonucle- 
ases. Mutations can be detected by different molecular weights of specific, individual fragments compared to the mo- 
lecular weights of the wildtype fragments. 

[0051] FIGURE 10A shows UV spectra resulting from the experiment described in the following Example 1 . Panel 



8 



EP 1 164 203 A2 



I) shows the absorbance of the 26-mer before hybridization. Panel ii) shows the filtrate of the centrifugation after hy- 
bridization. Panel iii) shows the results after the first wash with 50 mM ammonium citrate. Panel iv) shows the results 
after the second wash with 50 mM ammonium citrate. 

[0052] FIGURE 10B shows a mass spectrum resulting from the experiment described In the following Example 1 
after three washing/centrifugation steps. 

[0053] FIGURE IOC shows a mass spectrum resulting from the experiment described In the following Example 1 
showing the successful desorption of the hybridized 26-mer off of beads in accordance with the format depicted sche- 
matically in Figure 1 B. 

[0054] FIGURE 11 shows a mass spectrum resulting from the experiment described in the following Example 1 
showing the giving proof of an experiment as schematically depicted in FIGURE 1 B successful desorption of the hy- 
bridized 40-mer. The efficiency of detection suggests that fragments much longer than 40-mers can also be desorbed. 
Figure 12 shows a mass spectrum resulting from the experiment described in the following Example 2 showing the 
successful desorption and differentiation of an 18-mer and 19-mer by electrospray mass spectrometry, the mixture 
(top), peaks resulting from 18-mer emphasized (middle) and peaks resulting from 19-mer emphasized (bottom) 
[0055] FIGURE 13 is a graphic representation of the process for detecting the Cystic Fibrosis mutation AF508 as 
described in Example 3. 

[0056] FIGURE 14 is a mass spectrum of the DNA extension product of a AF508 homozygous normal of Example 3. 
[0057J FIGURE 1 5 is a mass spectrum of the DNA extension product of a AF508 heterozygous mutant of Example 3. 
[0058] FIGU RE 1 6 is a mass spectrum of the DNA extension product of a AF508 homozygous normal of Example 3. 
[0059] FIGURE 1 7 is a mass spectrum of the DNA extension product of a AF508 homozygous mutant of Example 3. 
[0060] FIGURE 1 8 Is a mass spectrum of the DNA extension product of a AF508 heterozygous mutant of Example 3. 
£0061] FIGURE 19 is a graphic representation of various processes for performing apolipo protein E genotyping of 
Example 4. 

[0062] FIGURE 20 shows the nucleic acid sequence of normal apollpoprotein E (encoded by the E3 allele, FIG. 20B) 
and other isotypes encoded by the E2 and E4 alleles (FIG. 20A). 

[0063] FIGURE 21 A shows a composite restriction pattern for various genotypes of apo lipoprotein E using the Cfol 
restriction endonuclease. 

[0064] FIGURE 21 B shows the restriction pattern obtained in a 3.5% MetPhor Agarose Gel for various genotypes 
of apolipoprotein E. 

[0065] FIGURE 21 C shows the restriction pattern obtained in a 12% polyacryiamide gel for various genotypes of 
apolipoprotein E. 

[0066] FIGURE 22A is a chart showing the molecular weights of the 91, 83, 72, 48 and 35 base pair fragments 
obtained by restriction enzyme cleavage of the E2, E3 and E4 alleles of apolipoprotein E. 

[0067] FIGURE 22B is the mass spectrum of the restriction product of a homozygous E4 apolipoprotein E genotype. 
[0068] FIGURE 23A is the mass spectrum of the restriction product of a homozygous E3 apolipoprotein E genotype. 
[0069] FIGURE 23B Is the mass spectrum of the restriction product of a E3/E4 apolipoprotein E genotype. 
[0070] FIGURE 24 Is an autoradlograph of Example 5 of a 7.5% polyacryiamide gel In which 10% (5uJ) of each 
amplified sample was loaded: sample M: pBR322 Alul digested; sample 1 : HBV positive in serological analysis; sample 
2: also HBV positive; sample 3 : without serological analysis but with an increased level of transaminases, indicating 
liver disease; sample 4 : HBV negative containing HCV; sample 5 : HBV posit-) negative control; (+) positive control). 
Staining was done with ethidium bromide. 

£0071] FIGU RE 25A is a mass spectrum of sample 1 , which is HBV positive. The signal at 20754 Da represents the 
HBV related amplification product (67 nucleotides, calculated mass: 20735 Da). The mass signai at 10390 Da repre- 
sents the [M+2Hp molecule ion (calculated: 1 0378 Da). 

[0072] FIGURE 25B is a mass spectrum of sample 3, which is HBV negative corresponding to nucleic acid (Le., 
PCR), serological and dot blot based assays. The amplified product is generated only in trace amounts. Nevertheless 
it is unambiguously detected at 20751 Da (calculated mass: 20735 Da). The mass signal at 10397 Da represents the 
[M+2HJ 2 * molecule ion (calculated: 10376 Da). 

[0073] FIGURE 25C is a mass spectrum of sample 4, which is HBV negative, but HCV positive. No HBV specific 
signals were observed. 

[0074] FIGURE 26 shows a part of the E. cofi lad gene with binding sites of the complementary oligonucelotldes 
used in the ligase chain reaction (LCR) of Example 6. Here the wildtype sequence is displayed. The mutant contains 
a point mutation at bp 1 91 which is also the site of ligation (bold) . The mutation is a C to T transition (G to A, respectively). 
This leads to a T-G mismatch with oligo B (and A-C mismatch with oligo C, respectively). 

[0075] FIGURE 27 is a 7.15% polyacryiamide gel of Example 6 stained with ethidium bromide. M: chain length 
standard (pUC19DNA, Msp\ digested). Lane 1 : LCR with wildtype template. Lane 2: LCR with mutant template. Lane 
3: (control) LCR without template. The ligation product (50 bp) was only generated in the positive reaction containing 
wildtype template. 
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[0076 J FIGURE 28 is an HPLC chromatogram of two pooled positive LCRs. 

(0077J FIGURE 29 shows an HPLC chromatogram the same conditions but mutant template were used. The small 
signal of the ligation product is due to either template-free ligation of the educts or to a ligation at a (G-T, A-C) mismatch. 
The 'false positive' signal is significantly lower than the signal of ligation product with wlldtype template depicted in 
Figure 28. The analysts of ligation educts leads to 'double-peaks* because two of the oligonucleotides are 5'-phospho- 
rylated. 

[0078] FIGURE 30 In (b) the complex signal pattern obtained by MALDI-TOF-MS analysis of Pfu DNA-ligase solution 
of Example 6 is depicted. In (a) a MALDI-TOF-spectrum of an unpurified LCR is 6hown. The mass signal 67569 Da 
probably represents the Pfu DNA ligase. 

[0079] FIGURE 31 shows a MALDI-TOF spectrum of two pooled positive LCRs (a). The signal at 7523 Da represents 
unligated oligo A (calculated: 7521 Da) whereas the signal at 15449 Da represents the ligation product (calculated: 
15450 Da). The signal at 3774 Da is the [M + 2HJ 2+ signal of ollgo A. The signals in the mass range lower than 2000 
Da are due to the matrix ions. The spectrum corresponds to lane 1 in figure 27 and the chromatogram in figure 28. In 
(b) a spectrum of two pooled negative LCRs (mutant template) Is shown. The signal at 7517 Da represents oligo A 
(calculated: 7521 Da). 

[0080] FIGURE 32 shows a spectrum of two pooled control reactions (with salmon sperm DNA as template). The 
signals in the mass range around 2000 Da are due to Tween20, only oligo A could be detected, as expected. 
[0081J FIGURE 33 shows a spectrum of two pooled positive LCRs (a). The purification was done with a combination 
of ultrafiltration and streptavidin DynaBeads as described in the text. The signal at 15448 Da represents the ligation 
product (calculated: 15450 Da). The signal at 7527 represents oligo A (calculated: 7521 Da). The signals at 3761 Da 
is the [M+2H] 2+ signal of oligo A, whereas the signal at 5140 Da is the [M+3Hp + signal of the ligation product. In (b) 
a spectrum of two pooled negative LCRs (without template) is shown. The signal at 7514 Da represents oligo A (cal- 
culated: 7521 Da). 

[0082] FIGURE 34 is a schematic presentation of the oligo base extension of the mutation detection primer as de- 
scribed in Example 7, using ddTTP (A) orddCTP (B) In the reaction mix, respectively. The theoretical mass calculation 
is given In parenthesis. The sequence shown is part of the exon 1 0 of the CFTR gene that bears the most common 
cystic fibrosis mutation AF508 and more rare mutations AI507 as well as He506Ser. 

[0083] FIGURE 35 is a MALDI-TOF-MS spectrum recorded directly from precipitated oligo base extended primers 
for mutation detection. The spectrum in (A) and (B), respectively show the annealed primer (CF508) without further 
extension reaction. Panel C displays the MALDI-TOF spectrum of the wild type by using pppTdd in the extension 
reaction and D a heterozygotic extension products carrying the 506S mutation when using pppCdd as terminator. 
Panels E and F show a heterozygote with AF508 mutation with pppTdd and pppCdd as terminators in the extension 
reaction. Panels G and H represent a homozygous AF508 mutation with either pppTdd or pppCdd as terminators. The 
template of diagnosis is pointed out below each spectrum and the observed/expected molecular mass are written In 
parenthesis. 

[0084] FIGURE 36 shows the portion of the sequence of pRFd DNA, which was used as template for nucleic acid 
amplification in Example 8 of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as 
the sequences of the 1 9-mer forward primer and the two 18-mer reverse primers. 

[0085] FIGURE 37 shows the portion of the nucleotide sequence of M13mp1 8 RFI DNA, which was used in Example 
8 for nucleic acid amplification of unmodified and 7-deazapurine containing 1 03-mer nucleic acids. Also shown are 
nucleotide sequences of the 1 7-mer primers used in the nucleic acid amplification reaction. 

[0086] FIGURE 38 shows the result of a pofyacrylamide gel electrophoresis of amplified products described in Ex- 
ample 8 purified and concentrated for MALDI-TOF MS analysis. M: chain length marker, lane 1 : 7-deazapurine con- 
taining 99-mer amplified product, lane 2: unmodified 99-mer, lane 3: 7-deazapurine containing 1 03-mer and lane 4: 
unmodified 1 03-mer amplified product 

[0087] FIGURE 39: an autoradlogram of polyacrylamide gel electrophoresis of nucleic acid (Le., PCR) reactions 
carried out with S'-f^PJ-labeled primers 1 and 4. Lanes 1 and 2: unmodified and 7-deazapurine modified 1 03-mer 
amplified product (53321 and 23520 counts), lanes 3 and 4: unmodified and 7-deazapurine modified 200-mer (71123 
and 39582 counts) and lanes 5 and 6: unmodified and 7-deazapurine modified 99-mer (173216 and 94400 counts). 
[0088] FIGURE 40 a) MALDI-TOF mass spectrum of the unmodified 1 03-mer amplified products (sum of twelve 
single shot spectra). The mean value of the masses calculated for the two single strands (31768 u and 31759 u) is 
31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 103-mer amplified product 
(sum of three single shot spectra). The mean value of the masses calculated for the two single strands (31 727 u and 
31 71 9 u) Is 31 723 u. Mass resolution: 67. 

[0089] FIGURE 41 : a) MALDI-TOF mass spectrum of the unmodified 99-mer amplified product (sum of twenty single 
shot spectra). Values of the masses calculated for the two single strands: 30261 u and 30794 u. b) MALDI-TOF mass 
spectrum of 7-deazapurine containing 99-mer amplified product (sum of twelve single shot spectra). Values of the 
masses calculated for the two single strands: 30224 u and 30750 u. 
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[0090] FIGURE 42: a) MALDI-TOF mass spectrum of the unmodified 200-mer amplified product (sum of 30 single 
shot spectra). The mean value of the masses calculated for the two single strands (61 873 u and 61595 u) is 61 734 u. 
Mass resolution: 28. b) MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer amplified product (sum of 
30 single shot spectra). The mean value of the masses calculated for the two single strands (61 772 u and 61 71 4 u) is 

5 61643 u. Mass resolution: 39. 

[0091] FIGURE 43: a) MALDI-TOF mass spectrum of 7«deazapurine containing 100-mer amplified product with ri- 
bomodified primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 
30812 u. b) MALDI-TOF mass spectrum of the amplified product after hydrolytic primer-cleavage. The mean value of 
the masses calculated for the two single strands (25104 u and 25229 u) is 251 67 u. The mean value of the cleaved 

10 primers (5437 u and 5918 u) is 5677 u. 

[0092] FIGURE 44 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a39-mer 
template (SEQ ID No. 23), which was immobilized to streptavldin beads via a 3' biotinylation. A 14-mer primer (SEQ 
ID NO. 24) was used in the sequencing according to Example 9. 

[0093] FIGURE 45 shows a MALDI-TOF mass spectrum of a solid phase sequencing of a 7B-mer template (SEQ ID 
*s No. 25), which was immobilized to streptavldin beads via a 3' biotinylation. A 18-mer primer (SEQ ID No. 26) and 
ddGTP were used in the sequencing. 

[0094] FIGURE 46 shows a scheme in which duplex DNA probes with single-stranded overhang capture specific 
DNA templates and also serve as primers for solid phase sequencing. 

[0095] FIGURE 47 A-D shows MALDI-TOF mass spectra obtained from a sequencing reaction using 5* fluorescent 
20 labeled 23-mer (SEQ ID No. 29) annealed to a 3' blotlnylated 18-mer (SEQ ID No. 30), leaving a 5-base overhang, 
which captured a 15-mer template (SEQ ID No. 31) as described In Example 9. 

[0096] FIGURE 48 shows a stacking fluorogram of the same products obtained from the reaction described in FIG- 
URE 47, but run on a conventional DNA sequencer. 

[0097] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder using cycle sequencing as de- 
zs scribed in Example 1 generated from a biological amplified product as template and a 1 2mer (5*-TGC ACC TGA CTC- 
3' (SEQ ID NO. 34)) sequencing primer. The peaks resulting from depurinations and peaks which are not related to 
the sequence are marked by an asterisk. MALDI-TOF MS measurements were taken on a reflectron TOF MS. A.) 
Sequencing ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; C.) Sequencing ladder stopped 
with ddGTP; D.) Sequencing ladder stopped with ddTTP. 
30 [0098] FIGURE 50 shows a schematic representation of the sequencing ladder generated in Fig. 49 with the corre- 
sponding calculated molecular masses up to 40 bases after the primer. For the calculation, the following masses were 
used: 3581 4Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and 328.2 Da for 
7-deaza-dGTP. 

[0099] FIGURE 51 shows the sequence of the amplified 209 bp amplified product within the p-globin gene, which 
55 was used as a template for sequencing. The sequences of the appropriate amplification primer and the location of the 
12mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 bases after 
the primer. In a wildtype sequence this T would be replaced by an A. 

[0100] FIGURE 52 shows a sequence which is part of the intron 5 of the interferon-receptor gene that bears the 
AluVpA polymorphism as further described in Example 11. The scheme presents the primer oligo base extension 
40 (PROBE) using ddGTP, ddCTP, or both for termination, respectively. The polymorphism detection primer (IFN) Is un- 
derlined, the termination nucleotides are marked in bold letters. The theoretical mass values from the alleles found in 
28 unrelated individuals and a five member family are given in the table. Both second site mutations found In most 13 
units allele, but not all, are indicated. 

[0101] FIGURE 53 shows the MALDI-TOF-MS spectra recorded directly form precipitated extended cyclePROBE 
43 reaction products. Family study using AluVpA polymorphism In intron 5 of the interferon-a receptor gene (Example 11 ). 
[0102] FIGURE 54 shows the mass spectra from PROBE products using ddC as termination nucleotide in the reaction 
mix. The allele with the molecular mass of approximately 11 650 da from the DNA of the mother and child 2 is a hint to 
a second site mutation within one of the repeat units. 

[0103] FIGURE 55 shows a schematic presentation of the PROBE method for detection of different alleles in the 
so poryT tract at the 3'-end of intron 8 of the CFTR gene with pppCdd as terminator (Example 11 ). 

[0104] FIGURE 56 shows the MALDI-TOF-MS spectra recorded directly from the precipitated extended PROBE 
reaction products. Detection of all three common alleles of the polyT tract at the 3* end of Intron 8 of the CFTR gene, 
(a) T5/T9 heterozygous, (b) T7/T9 heterozygous (Example 11). 

[0105] FIGURE 57 shows a mass spectrum of the digestion of a 252-mer ApoE gene amplified product (E3/e3 
53 genotype) as described in Example 12 using a) Cfot alone and b) Cfol plus Rsal. Asterisks: dep urination peaks. 

[0106] FIGURE 58 shows a mass spectrum of the ApoE gene amplified product (e3/G3 genotype) digested by Cfol 
and purified by a) single and b) double ethanol/glycogen and c) double Isopropyl alcohol/glycogen precipitations. 
[0107] FIGURE 59 shows a mass spectrum of the Cfol/Rsal digest products from a) G2/e3. b) e3/e3, c) G3/e4, 
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and d) e4/e4 genotypes. Dashed lines are drawn through diagnostic fragments. 

[0108] FIGURE 60 shows a scheme for rapid identification of unknown ApoE genotypes following simultaneous 
digestion of a 252-mer apo E gene amplified product by the restriction enzymes Cfol and Rsal. 
[0109] FIGURE 61 shows the multiplex (codons 112 and 158) mass spectrum PROBE results for a) G2/G3, b) E3/e3, 
c) G3/G4, and d) G4/G4 genotypes. E: extension products; P: unextended primer. Top: codon 112 and 158 regions, 
with polymorphic sites bold and primer sequences underlined. 

[01 10] FIGURE 62 shows a mass spectrum of a TRAP assay to detect telomerase activity (Example 13). The spec- 
trum shows two of the primer signals of the amplified product TS primer at 5,497.3 Da (calc. 5623 Da) and the biotl- 
nylated bioCX primer at 7,537.6 Da (calc. 7,537 Da) and the first telom erase-specif ic assay product containing three 
telomeric repeats at 12,775.8 Da (calc. 1 2,452 Da) its mass is largerby one dA nucleotide (1 2,765 Da) due to extendase 
activity of Taq DNA polymerase. 

[01 1 1 ] FIGURE 63 depicts the higher mass range of FIGURE 62, i.e. the peak at 1 2,775.6 Da represents the products 
with these telomeric repeats. The peaks at 20,322.1 Da is the result of a telomerase activity to form seven telomeric 
repeats (calc. 20,395 Da including the extension by one dA nucleotide). The peaks marked 1 , 2, 3 and 4 contain a four 
telomeric repeats at 14,674 Da as well as secondary ion product. 

[01 12] FIGURE 64 displays a MALDI-TOF spectrum of the RTF-amplified product of the human tyrosine hydroxylase 
mRNA indicating the presence of neuroblastoma ceils (Example 14). The signal at 18,763.8 Da represents the non- 
biotinylated single-stranded 61 mer of the nested amplified product (calc. 18,758.2 Da). 

[0113] FIGURE 65 (a) shows a schematic representation of a PROBE reaction for the RET proto-oncogene with a 
mixture of dATP, dCTP, dGTP, and ddTTP (Example 15). B represents blotin, through which the sense template strand 
is bound through streptavidin to a solid support. Figure 65(b) shows the expected PROBE products for ddT and ddA 
reactions for wildtype, C->T, and C-»A antisense strands. 

[01 14] FIGURE 66 shows the PROBE product mass spectra for (a) negative control, (b) Patient 1 being heterozygote 
(VWC->T) and (c) Patient 2 being heterozygote (Wt/C->A), reporting average M r values. 

[0115] FIGURE 67 shows the MALDI-FTMS spectra for synthetic analogs representing ribo-cleaved RET proto- 
oncogene ampiified products from (a) wildtype, (b) G->A, and (c) G-»T homozygotes, and (d) wildtype/G-»A. (e) 
wildtype/G->T, and (f) G-»A/G-»T heterozygotes, reporting masses of most abundant isotope peaks. 
[0116] FIGURE 68 is a schematic representation of nucleic acid immobilization via covalent Afunctional trityl linkers. 
[0117] FIGURE 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers. 
[0118] FIGURE 70 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 
bound oligo (S'-iminobiotin - TGC ACCTGACTC , SEQ ID NO. 56). An internal standard (CTGTGGTCGTGC, SEQ ID 
NO. 57) was included in the matrix. 

[011 9) FIGURE 71 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing 
bound oligo (5'-iminobiotin - TGCACCTGACTC , SEQ ID NO. 56). An internal standard (CTGTGGTCGTGC, SEQ ID 
NO. 57) was included in the matrix. 

[0120] FIGURE 72 schematicaliy depicts the steps Involved with the Loop-primer oligo base extension (Loop-probe) 
reaction. 

[0121 ] FIGURE 73A shows a MALDI-TOF mass spectrum of a supernatant after Cfol digest of a stem loop. Figure 
73B-D show MALDI-TOF mass spectrum of different genotypes: HbA the wildtype genotype (74B), HbC, a mutation 
of codon 6 of the p-globin gene which causes sickle cell disease (74C), and HbS, a different mutation of codon 6 of 
the p-globin gene which causes sickle cell disease (74 D). 

[0122] FIGURE 74 shows the nucleic acid sequence of the amplified region of CKR-5. The underlined sequence 
corresponds to the region homologous to the amplification primers. The dotted region corresponds to the 32 bp deletion. 
[0123] FIGURE 75 shows the sense primer ckrT7f. Being designed to facilitate binding of T7-RNA polymerase and 
amplification of the CKR-5 region to be analyzed, it starts with a randomly chosen sequence of 24 bases, the T7 
promoter sequence of 18 bases and the sequence homologous to CKR-5 of 19 bases. 

[0124J FIGURE 76 is a MALDI-TOF mass spectrum of the CKR-5 amplification product, which was generated as 
described in the following Example 21 . 

[0125] FIGURE 77 is a positive ion UV-MALDI mass spectra of a synthetic RNA 25-mer (S'-UCCGGUCUGAU- 
GAGUCCGUGAGGAC-3' SEQ ID NO. 62) digested with selected RNAses. For each enzyme 0.6ui allquots of ten 4.5uJ 
assay containing a total of ca. 20 pmol of the RNA were fixed with 1 .5uJ matrix (3-HPA) for analysis. Fragments with 
retained S'-terminus are marked by different arrows, specific for the different RNAses, (Hahner et aL, Proceedings of 
the 44 th ASMS Conference on Mass Spectrometry and Allied Topics, p. 983 (1 996)). 

[0126] FIGURE 78 Is an investigation of the specificity of the RNAses CLj and Cusativin by positive ion UV-MALDI 
mass spectra of a synthetic RNA 20-mer. Expected and/or observed cleavage sites are indicated by arrows. A, B, C 
indicate correct cleavage sites and corresponding singly cleaved fragments. Missing cleavages are designated by a 
question mark (?), unspecific cleavages by an X. 

[01 27] FIGURE 79 shows the separation of a mixture of DNA molecules (1 2-mer, 5'-biot. 1 9-mer, 22-mer and 5'-biot. 
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27-mer) with streptavidin -coated magnetic beads, a) positive ion UV-MALDI mass spectrum of 0.6ut of a mixture con- 
taining ca, 2-4 pmol of each species mixed with 1 ,5uJ matrix (3-HPA). b) same as a) but incubation of the mixture with 
magnetic beads and subsequent release of the captured fragments. 

[0128] FIGURE 80 Elution of immobilized 5* biotinylated49 nt in Wfro transcript from the streptavidin-coated magnetic 
beads. Positive UV-MALDI mass spectrum of the transcript prior to incubation with the magnetic beads (a). Spectra of 
the immobilized RNA transcript after elution with 95% formamide alone (b) and with various additives such as 10mM 
EDTA (c), 10mM CDTA (d) and 25% ammonium hydroxide (e); EOTA and CDTA were adjusted with 25% ammonium 
hydroxide to a pH of 8. 

[0129] FIGURE 81 Positive UV-MALDI mass spectra of the 5' biotinylated 49 nt in vitro transcript after RNAse U 2 
digest for 15 minutes, a) Spectrum of the 25 ul assay containing ca. 100 pmol of the target RNA before separation; b) 
spectrum after isolation of the 5-biotinylated fragments with magnetic beads. Captured fragments were released by a 
solution of 95% formamide containing 1 0 mM CDTA. 1 ul aliquots of the samples were mixed with 1 .5 ul matrix (3-HPA) 
in both cases. 

[0130] FIGURE 82 schematically depicts detection of putative mutations in the human P-giobin gene at codon 5 and 
6 and at codon 30, and the IVS-1 donor site, respectively, done in parallel. FIGURE 82A shows amplification of genomic 
DNA using the primers 02 and P11 . The location of the primers and identification tags as well as an indication of the 
wild type and mutant sequences are shown. FIGURE 82B shows analysis of both sites in a simple Primer Reaction 
Oligo Base Extension (PROBE) using primers p-TAG1 (which binds upstream of codon 5 and 6) and 0-TAG2 (which 
binds upstream of codon 30 and the IVS-1 donor site). Reaction products are captured using streptavidin-coated par- 
amagnetic particle bound biotinylated capture primers (cap-tag- 1 and cap-tag-2, respectively), that have 6 bases at 
the 5* end that are complementary to the 5* end of 0-TAG1 and 0-TAG2, respectively, and a portion which binds to a 
universal primer. 

[0131] FIGURE 83 shows a mass spectrum of the PROBE products of a DNA sample from one individual analyzed 
as described schematically in FIGURE 82. 

[0132] FIGURE 84 shows a mass spectrum of the sequence bound to cap-tag-2. 

[0133] FIGURE 85 shows a mass spectrum obtained by using the 0-TAG1 and p-TAG 2 primers in one sequencing 

reaction using ddATP for termination and then sorting according to the method depicted in FIGURE 82. 

[0134] FIGURE 86 shows a mass spectrum obtained by using the p-TAG 1 and P-TAG2 primers in one sequencing 

reaction using ddCTP for termination and then sorting according to the method depicted in FIGURE 82. 

[0135] FJGURE 87A shows the wiidtype sequence of a fragment of the chemokine receptor CKR-5 gene with primers 

(bold) used for amplification. The 32 base pair (bp) deletion in the CKR-5 allele is underlined; and the stop nucleotides 

are in italic. In FIGURE 87B, the wiidtype strands are depicted with and without an added Adenosine, their length and 

molecular masses are indicated. FIGURE 87C indicates the same for the 32 bp deletion. FIGURE 87D shows the 

PROBE products for the wiidtype gene and FIGURE 87E shows the mutated allele. 

[01 36) FIGURE 88 shows the amplification products of different unrelated individuals as analyzed by native polyacr- 
ylamide gei electrophoreses (15%) and silver stain. The band corresponding to a wiidtype CKR-5 runs at 75 bp and 
the band from the gene with the deletion at 43 bp. Bands bigger than 75 bp are due to unspecific amplification. 
[0137] FIGURE 89A shows a spectrograph of DNA derived from a heterozygous individual: the peak with a mass of 
23319 Da corresponds to the wiidtype CKR-5 and the peaks with masses of 131 37 Da and 13451 Da to the deletion 
allele with and without an extra Adenosine, respectively. FIGURE 89B shows a spectrograph of DNA obtained from 
the same individual as in FIGURE 89A, but the DNA was treated with T4 DNA polymerase to remove the added Ade- 
nosine. FIGURES 89C and 89D are spectrographs derived from homozygous individuals and in FIGURE 89D, the 
Adenosine has been removed. All peaks with masses lower than 13000 Da are due to multiple charged molecules. 
[0138] FIGURE 90A shows the mass spectrum of the results of a PROBE reaction performed on DNA obtained from 
a heterozygous Individual. FIGURE 90B shows a mass spectrum of the results of a PROBE reaction on a homozygous 
Individual. The peaks with masses of 6604 Da and 6607 Da, respectively correspond to the wiidtype allele, and the 
peak with a mass of 6275 Da to the deletion allele. The primer is detected with a mass of 5673 and 5676 Da, respectively. 
[0139J FIGURE 91 shows a MALDI-TOF MS spectra of a thermocycling primer Oligo Base Extension (tc-PROBE) 
reaction as described in Example 24 using three different templates and 5 different PROBE primers simultaneously in 
one reaction. 

[0140] FIGURE 92 schematically depicts a single tube process for amplifying and sequencing exons 5-8 of the p53 
gene as described in Example 25. The mass spectrum is the A reaction of Figure 93. 

[0141] FJGURE 93 shows a superposition plot of four separate reactions for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 

[0142] FIGURE 94 shows the mass spectrum obtained from the A reaction for sequencing a portion of exon 7 of the 
p53 gene as described in Example 25. 

[01 43] FIGURE 95 shows the mass spectrum of a p53 sequencing ladder for which 5nL of each reaction were trans- 
ferred to wells of a chip and measured by MALDI-TOF. 
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[01 44] FIGURE 96A shows a MALDI-TOF mass spectra of a synthetic 50-mer (1 5.34 kDa) mixed with 27-mer TO (non- 
complementary, 8.30 kDa). 

[0145] FIGURE 96B shows a MALDI-TOF mass spectra of a synthetic 50-mer (15.34k Da) mixed with a 27-mer c 
(complementary, 8.34 kDa). The final concentration of each oligonucleotide was 10u.M. The signal at 23.68 kDa in 
Figure 96B corresponds to WC-specific dsDNA. 

[0146] FIGURE 97A shows a MALDI-TOF mass spectrum of Cfol/Rsal digest products of a region of exon 4 of the 
apolipoprotein E gene (E3 genotype), using sample preparation as in Figure 96. 

[0147] FIGURE 97B is the same as Figure 97A, except with samples prepared for MALDI-TOF analysis at 4*C. 
[0148] FIGURE 98 shows a MALDI-TOF mass spectrum of Cfol/Rsal simultaneously double digest products of a 
252 base pair region of exon 4 of the apolipoprotein E gene (e4 genotype), with samples prepared at 4°C. 
[0149] FIGURE 99 shows the mass spectra obatfned on a small population study of 1 5 patients with a 16 element 
array of diagnostic products transferred to a MALDI target using a pintool microdispenser. 

[0150] FIGURE 1 00 is a MALDI mass spectrum of an aliquot sampled after a T, digest of a synthetic 20-mer RNA. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

[0151] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is com- 
monly understood by one of skill in the art to which this invention belongs. Where permitted the subject matter of each 
of the co-pending patent applications and the patent is herein incorporated in its entirety. 

[0152] As used herein, the term "biological sample" refers to any material obtained from any living source ( e.g. , 
human, animal, plant, bacteria, fungi, protlst, virus). For purposes herein, the biologicaJ sample will typically contain a 
nucleic acid molecule. Examples of appropriate biological samples include, but are not limited to: solid materials (e.g. , 
tissue, cell pellets, biopsies) and biological fluids (e.g. , urine, blood, saliva, amniotic fluid, mouth wash, cerebral spinal 
fluid and other body fluids). 

[0153] As used herein, the phrases "chain-elongating nucleotides' 1 and "chain-terminating nucleotides" are used in 
accordance with their art recognized meaning. For example, for DNA, chain-elongating nucleotides include 2'deoxyri- 
bonucleotides ( e.g. , dATP, dCTP, dGTP and dTTP) and chain -terminating nucleotides include 2', 3'- dideoxy ribonucle- 
otides (e.g. , ddATP, ddCTP, ddGTP, ddTTP). For RNA, chain-elongating nucleotides include ribonucleotides (e.g. , 
ATJP, CTP, GTP and UTP) and chain-terminating nucleotides Include S'-deoxyribonucleotides (e.g. . 3'dA, 3'dC, 3*dG 
and 3'dU). A complete set of chain elongating nucleotides refers to dATP, dCTP, dGTP and dTTP. The term 'nucleotide" 
is also well known in the art. 

[0154] As used herein, nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include mod- 
ified nucleotides such as phosphorothtoate nucleotides and deazapurine nucleotides. A complete set of chain-elon- 
gating nucleotides refers to four different nucleotides that can hybridize to each of the four different bases comprising 
the DNA template. 

[0155] As used herein, the superscript O-i designates i + 1 mass differentiated nucleotides, primers or tags. In some 
instances, the superscript O can designate an unmodified species of a particular reactant, and the superscript i can 
designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids are 
to be concurrently detected, then I + 1 different mass-modified detector oligonucleotides (D°, D 1 , ... D 1 ) can be used 
to distinguish each species of mass modified detector oligonucleotides (D) from the others by mass spectrometry. 
[0156] As used herein, "multiplexing" refers to the simultaneously detection of more than one anatyte, such as more 
than one (mutated) loci on a particular captured nucleic acid fragment (on one spot of an array). 
[0157] As used herein, the term "nucleic acid" refers to single-stranded and/or double-stranded polynucleotides such 
as deoxyribonucleic acid (DNA), and ribonucleic acid (RNA) as well as analogs or derivatives of either RNA or DNA. 
Also included in the term "nucleic acid" are anaiogs of nucleic acids such as peptide nucleic acid (PNA), phospho- 
rothtoate DNA, and other such analogs and derivatives. 

[0158] As used herein, the term "conjugated" refers stable attachment, preferably ionic or covaient attachment. 
Among preferred conjugation means are: streptavldin- or avidin- to biotln Interaction; hydrophobic interaction; magnetic 
interaction (e.g. , using functionalized magnetic beads, such as DYNABEADS, which are streptavi din-coated magnetic 
beads sold by Dynal, Inc. Great Neck, NY and Oslo Norway); polar interactions, such as "wetting" associations between 
two polar surfaces or between oligo/polyethylene glycol; formation of a covaient bond, such as an amide bond, disulfide 
bond, thioether bond, or via crosslinkJng agents; and via an acid-labile or photocleavable linker. 
[0159] As used herein equivalent, when referring to two sequences of nucleic acids means that the two sequences 
in question encode the same sequence of amino acids or equivalent proteins. When "equivalent" is used in referring 
to two proteins or peptides, it means that the two proteins or peptides have substantially the same amino acid sequence 
with only conservative amino acid substitutions that do not substantially alter the activity or function of the protein or 
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peptide. When "equivalent - refers to a property, the property does not need to be present to the same extent [e.g. , two 
peptides can exhibit different rates of the same type of enzymatic activity], but the activities are preferably substantially 
the same. "Complementary, " when referring to two nucleotide sequences, means that the two sequences of nucleotides 
are capable of hybridizing, preferabry with less than 25%, more preferabry with less than 15%, even more preferably 
with less than 5%, most preferably with no mismatches between opposed nucleotides. Preferably the two molecules 
will hybridize under conditions of high stringency. 

[0160] As used herein: stringency of hybridization in determining percentage mismatch are those conditions under- 
stood by those of skill in the art and typically are substantially equivalent to the following: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65*C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1% SDS, 50°C 

It is understood that equivalent stringencies may be achieved using alternative buffers, salts and temperatures. 
[0161] As used herein, a primer when set forth in the claims refers to a primer suitable for mass spectrometry methods 
requiring immobilizing, hybridizing, strand displacement, sequencing mass spectrometry refers to a nucleic acid must 
be of low enough mass, typically about 70 nucleotides or less than 70, and of sufficient size to be useful in the mass 
spectrometry methods described herein that rely on mass spectrometry detection. These methods Include primers 
for detection and seequening of nucleic acids, which require a sufficient number nucleotides to from a stable duplex, 
typically about 6-30, preferably about 10-25, more preferably about 12-20. Thus, for purposes herein a primer will be 
a sequence of nucleotides comprising about 6-70, more preferably a 12-70, more preferabry greater than about 14 to 
an upper limit of 70, depending upon sequence and application of the primer. The primers herein, for example for 
mutational analyses, are selected to be upstream of loci useful for diagnosis such that when performing using sequenc- 
ing up to or through the site of interest, the resulting fragment is of a mass that sufficient and not too large to be detected 
by mass spectrometry. For mass spectrometric methods, mass tags or modifier are preferabry included at the 5'-end, 
and the primer is otherwise unlabeled. 

[0162] As used herein, "conditioning" of a nucleic acid refers to modification of the phosphodiester backbone of the 
nucleic acid molecule (e.g. , cation exchange) for the purpose of eliminating peak broadening due to a heterogeneity 
in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating agent such as akyliodlde, 
iodoacetamide, p-iodoethanol, or2,3-epoxy-1 -propanol,the monothio phosphodiester bonds of a nucleic acid molecule 
can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged 
derivatives employing trialkylsrlyl chlorides. Further conditioning involves incorporating nucleotides that reduce sensi- 
tivity for depurination (fragmentation during MS) ejj., a purine analog such as N7- or N9-deazapurine nucleotides, or 
RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions that are alkylated 
or employing oligonucleotide mimetics such as peptide nucleic acid (PNA). 

[0163] As used herein, substrate refers to an insoluble support onto which a sample is deposited according to the 
materials described herein. Examples of appropriate substrates include beads (e.g., silica gel, controlled pore glass, 
magnetic, agaroase gele and crosslinked dextroses (jj^ Sepharose and Sephadex, cellulose and other materials 
known by those of skill in the art to serve as solid support matrices. For examples substrates may be formed from any 
or combltlons of: silica gel, glass, magnet, polystyrene/1% dlvinylbenzene resins, such as Wang resins, which are 
Fmoc-amino acld-4-(hydroxymethyl)phenoxymethylcopoly(styrene-1 % divinylbenzene (DVD)) resin, chlorotrityl 
(2-chlorotritylchloride copolystyrene-DVB resin) resin, Merrifield (chloromethylated copofystyrene-DVB) resin metal, 
plastic, cellulose, cross-linked dextrans, such as those sold under the tradename Sephadex (Pharmacia) and agarose 
gel, such as gels sold under the tradename Sepharose (Pharmacia), which is a hydrogen bonded porysaccharide-type 
agarose gel, and other such resins and solid phase supports known to those of skill in the art. The support matrices 
may be in any shape or form, including, but not limited to: capillaries, flat supports such as glass fiber filters, glass 
surfaces, metaJ surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials including multiwell plates 
or membranes (e.g., of polyethylene, polypropylene, poryamide, polyvinylidenedifluoride), pins (e.g., arrays of pins 
suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers (e.g., silicon wafers) 
with or without plates, and beads. 

[0164] As used herein, a selectively cleavable linker is a linker that is cleaved under selected conditions, such as a 
photocleavable linker, a chemically cleavable linker and an enzymatically cleavable linker (i.e., a restriction endonu- 
cleaso site or a ribonucIeotide/RNase digestion). The linker Is interposed between the support and immobilized DNA. 

Isolation of nucleic acids molecules 

[0165] Nucleic acid molecules can be isolated from a particular biological sample using any of a number of proce- 
dures, which are well-known In the art, the particular isolation procedure chosen being appropriate for the particular 



15 



EP 1 164 203 A2 



biological sample. For example, freeze-thaw and alkaline lysis procedures can be useful for obtaining nucleic acid 
molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules 
from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (see, ej^, Rolff et aL (1994) 
PGR: Clinical Diagnostics and Research, Springer). 

[0166] To obtain an appropriate quantity of a nucleic acid molecules on which to perform mass spectrometry, ampli- 
fication maybe necessary. Examples of appropriate amplification procedures for use herein include: cloning (Sambrook 
et aL, Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, 1989), polymerase chain reaction 
(PCR) (C.FL Newton and A. Graham, PCR, BIOS Publishers, 1 994), ligase chain reaction (LCR) (see, e^ , Weidmann 
et aL (1994) PCR Methods Appi . Vol. 3, Pp. 57-64; F. Barany (1991) Proc. Natl. Acad. ScL U.S.A. 88:189-93). strand 
displacement amplification (SDA) (see, e.g. , Walker et al, (1994) Nucleic Acids Res. 22:2670-77) and variations such 
as RT-PCR (see, e^g., Higuchl et al (1993) Bio/Technology 11:1026-1030), allele-specific amplification (ASA) and 
transcription based processes. 

Immobilization of nucleic acid molecules to solid supports 

[0167] To facilitate mass spectrometric analysis, a nucleic acid molecule containing a nucleic acid sequence to be 
detected can be immobilized to an insoluble (i.e., a solid) support. Examples of appropriate solid supports include 
beads (e.g. , silica gel, controlled pore glass, magnetic, Sephadex/Sepharose, cellulose), capillaries, flat supports such 
as glass fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials 
including multiweil plates or membranes (e.g. , of polyethylene, polypropylene, poiyamide, polyvinylidenedifluoride), 
P' ns ( e -9 - a^ays of pins suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers 
(ejj., silicon wafers) with or without fitter plates. 

[0168] Samples containing target nucleic acids can be transferred to solid supports by any of a variety of methods 
known to those of skill in the art. For example, nucleic acid samples can be transferred to Individual wells of a substrate, 
e^, silicon chip, manually or using a pintool mfcrodispenser apparatus as described herein. Alternatively, a piezoe- 
lectric pipette apparatus can be used to transfer small nanoliter samples to a substrate permitting the performance of 
high throughput miniaturized diagnostics on a chip. 

[0169] Immobilization can be accomplished, for example, based on hybridization between a capture nucleic add 
sequence, which has already been immobilized to the support and a complementary nucleic acid sequence, which is 
also contained within the nucleic acid molecule containing the nucleic acid sequence to be detected (FIGURE 1 A). So 
that hybridization between the complementary nucleic acid molecules is not hindered by the support, the capture nucleic 
acid can include an e.g. , spacer region of at least about five nucleotides in length between the solid support and the 
capture nucleic acid sequence. The duplex formed will be cleaved under the influence of the laser pulse and desorption 
can be initiated. The solid support-bound nucleic acid molecule can be presented through natural oligoribo- or ollgo- 
dooxyribo nucleotide as well as analogs (e.g. , thlo-modified phosphodiester or phosphotriester backbone) or employing 
oligonucleotide mimetics such as PN A analogs (see, eg., Nielsen et a?„ Science 254 :1497 (1991)) which render the 
base sequence less susceptble to enzymatic degradation and -bound capture base sequence. 

Linkers 

[0170] A target detection site can be directly linked to a solid support via a reversible or irreversible bond between 
an appropriate functionality (L*) on the target nucleic acid molecule (T) and an appropriate functionality (L) on the 
capture molecule (FIGURE 1 B). A reversible linkage can be such that it is cleaved under the conditions of mass spec- 
trometry (i.e., a photocleavable bond such as a charge transfer complex or a labile bond being formed between relatively 
stable organic radicals). 

[0171] Photocleavable linkers are linkers that arecJeaved upon exposure to light (see, e^g., Goldmacher et aL (1992) 
Bioconj. Chem. 3:104-107). thereby releasing the targeted agent upon exposure to light. Photocleavable linkers that 
are cleaved upon exposure to light are known (see, e^., Hazum et aL (1981) in Pept., Proc. Eur. Pept. Symp., 16th , 
Brunfeldt, K (Ed), pp. 105-110, which describes the use of a nitrobenzyl group as a photocleavable protective group 
for cysteine; Yen et aL (1989) Makromol. Chem 190 :69-82. which descrtoes water soluble photocleavable copolymers, 
including hydroxypropylmethacryiamide copolymer, glycine copolymer, fluorescein copolymer and methylrhodamine 
copolymer; Goldmacher et aL (1 992) Bioconj. Chem. 3 :104-1 07, which describes a cross-linker and reagent that un- 
dergoes photolytlc degradation upon exposure to near UV light (350 nm); and Senter et al. (1 985) Photochem. Photobiol 
42:231 -237, which describes nitrobenzyloxycarbonyl chloride cross linking reagents that produce photocleavable link- 
ages), thereby releasing the targeted agent upon exposure to light, in preferred embodiments, the nucleic acid is 
immobilized using the photocleavable linker moiety that is cleaved during mass spectrometry. Presently preferred 
photocleavable linkers are set forth in the EXAMPLES. 

[0172] Furthermore, the linkage can be formed with L' being a quaternary ammonium group, In which case, preferably, 
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the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone 
and thus facilitate the desorption required for analysis by a mass spectrometer. Desorption can occur either by the 
heat created by the laser pulse and/or, depending on L* by specific absorption of laser energy which is in resonance 
with the L 4 chromophore. 

[01731 Thus, the L-L' chemistry can be of a type of disulfide bond (chemically cleavable, for example, by mercap- 
toethanol or dithioerythrol), a biotin/streptavldJn system, a heterobifunctional derivative of a trityl ether group (see, e, 
g., Koster et a^ (1 990) "A Versatile Acid-Labile Linker for Modification of Synthetic Biomolecules, " Tetrahedron Letters 
31 :7095) that can be cleaved under mildly acidic conditions as well as under conditions of mass spectrometry, a levulinyl 
group cleavabie under almost neutral conditions with a hydraziniurn/acetate buffer, an arginine-arginine or lysine- lysine 
bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond cleavable by a pyrophosphatase, 
or a ribonucleotide bond in between the oligodeoxy nucleotide sequence, which can be cleaved, for example, by a 
ribonuclease or alkali. 

[0174J The functionalities, L and L,' can also form a charge transfer complex and thereby form the temporary L-L' 
linkage. Since in many cases the "charge-transfer band 1 * can be determined by UV/vIs spectrometry (see, e.g. , Organic 
Charge Transfer Complexes by R. Foster, Academic Press, 1 969), the laser energy can be tuned to the corresponding 
energy of the charge-transfer wavelength and, thus, a specific desorption off the solid support can be Initiated. Those 
skilled in the art will recognize that several combinations can serve this purpose and that the donor functionality can 
be either on the solid support or coupled to the nucleic acid molecule to be detected or vice versa. 
[0175] In yet another approach, a reversible L-L* linkage can be generated by homolytically forming relatively stable 
radicals. Under the influence of the laser pulse, desorption (as discussed above) as well as ionization will take place 
at the radical position. Those skilled In the art will recognize that other organic radicals can be selected and that, in 
relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding laser 
wavelength can be selected (see e^., Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). 
[0176] An anchoring function L' can also be incorporated into a target capturing sequence (TCS) by using appropriate 
primers during an amplification procedure, such as PCR (FIGURE 4), LCR (FIGURE 5) or transcription amplification 
(FIGURE 6A). 

[01 77} When performing exonuclease sequencing using MALDI-TOF MS, a single stranded DNA molecule immobi- 
lized via its 5-end to a solid support is unilaterally degraded with a 3' -p recessive exonuclease and the molecular weight 
of the degraded nucleotide Is determined sequentially. Reverse Sanger sequencing reveals the nucleotide sequence 
of the immobilized DNA. By adding a selectively cleavable linker, not onJy can the mass of the free nucleotides be 
determined but also, upon removal of the nucleotides by washing, the mass of the remaining fragment can be detected 
by MALDI-TOF upon cleaving the DNA from the solid support Using selectively cleavable linkers, such as the photo- 
cleavable and chemical cleavable linkers provided herein, this cleavage can be selected to occur during the ionization 
and volatizing steps of MALDI-TOF. The same rationale applies for a 5' immobilized strand of a double stranded DNA 
that is degraded while in a duplex. Likewise, this also applies when using a 5 -processive exonuclease and the DNA 
is immobilized through the 3'-end to the solid support. 

[0178J As noted, at least three version of immobilization are contemplated herein: 1) the target nucleic acid is am- 
plified or obtained (the target sequence or surrounding DNA sequence must be known to make primers to amplify or 
isolated); 2) the primer nucleic acid is immobilized to the solid support and the target nucleic acid is hybridized thereto 
(this is for detecting the presence of or sequencing a target sequence in a sample); or 3) a double stranded DNA 
(amplified or isolated) is immobilized through linkage to one predetermined strand, the DNA is denatured to eliminate 
the duplex and then a high concentration of a complementary primer or DNA with identity upstream from the target 
site is added and a strand displacement occurs and the primer is hybridized to the immobilized strand. 
[01 79] In the embodiments where the primer nucleic acid is immobilized on the solid support and the target nucleic 
acid is hybridized thereto, the Inclusion of the cleavable linker allows the primer DNA to be immobilized at the 5'-end 
so that free 3'OH is available for nucleic acid synthesis (extension) and the sequence of the "hybridized" target DNA 
can be determined because the hybridized template can be removed by denaturation and the extended DNA products 
cleaved from the solid support for MALDI-TOF MS. Similarly for 3), the Immobilized DNA strand can be elongated when 
hybridized to the template and cleaved from the support. Thus, Sanger sequencing and primer ollgo base extension 
(PROBE), discussed below, extension reactions can be performed using an immobilized primer of a known, upstreamn 
DNA sequence complementary to an invariable region of a target sequence. The nucleic acid from the person Is ob- 
tained and the DNA sequence of a variable region (deletion, Insertion, missense mutation that cause genetic predis- 
position or diseases, or the presence of viral/bacterial or fungal DNA) not only Is detected, but the actual sequence 
and position of the mutation is also determined. 

[01 80) In other cases, the target DNA must be immobilized and the primer anneaJed. This requires amplifying a larger 
DNA based on known sequence and then sequencing the immobilized fragments (i.e., the extended fragments are 
hybridized but not immobilized to the support as described above). In these cases, it is not desirable to include a linker 
because the MALDI-TOF spectrum Is of the hybridized DNA; it is not necessary to cleave the immobilized template. 
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[0181] Any linker known to those of skill fn the art for Immobilizing nucleic acids to solid supports may be used herein 
to link the nucleic acid lo a solid support. The preferred linkers herein are the selectively cteavable linkers, particularly 
those exemplified herein. Other linkers include, acid cteavable linkers, such as bismaleimideothoxy propane, acid- 
labile trityl linkers. 

s [01 82] Acid cleavable linkers, photodeavable and heat sensitive linkers may also be used, particularly where it may 
be necessary to cleave the targeted agent to permit it to be more readily accessible to reaction. Acid cleavable linkers 
include, but are not limited to, bismaleimideothoxy propane; and adlpic acid dihydrazide linkers (see, ejj., Fattom et 
§L < 1 992) Infection & Immun. 60:584-589) and acid labile transferrin conjugates that contain a sufficient portion of 
transferrin to permit entry into the intracellular transferrin cycling pathway (see, ejj.. Welhoner et al. (1991) J. Biol. 

10 Chem. 266 :4309-4314). 

Photodeavable Linkers 

[01 83] Photodeavable linkers are provided. In particular, photodeavable linkers as their phosphoramidite derivatives 
*5 are provided for use in solid phase synthesis of oligonucleotides. The linkers contain o-nitrobenzyl moieties and phos- 
phate linkages which allow for complete photolytic cleavage of the conjugates within minutes upon UV irradiation. The 
UV wavelengths used are selected so that the irradiation will not damage the oligonucleotides and are preferrably 
about 350-380 nm, more preferably 365 nm. The photodeavable linkers provided herein possess comparable coupling 
efficiency as compared to commonly used phosphoramidite monomers (see, Sinha et aL (1983) Tetrahedron Lett. 24: 
20 5843-5846; Sinha et al (1 984) Nucleic Adds Res. 12:4539-4557; Beaucage et aL (1 993) Tetrahedron 49:61 23-61 94; 
and Matte ucci et aL (1 98 1 ) J. Am. Chem. Soc. 103:31 85-3 1 9 1 ) . 
[0184] In one embodiment, the photodeavable linkers have formula I: 
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30 




35 

where R 20 is (o-(4,4'-dimethoxytrityloxy)alkyl or o-hydroxyalkyi; R 21 is selected from hydrogen, alkyl, aryl, alkoxycar- 
bonyl, aryloxycarbonyl and carboxy; R 22 is hydrogen or (diaJkylamino)(a)-cyanoalkoxy)P-; t Is 0-3; and R 50 is alkyl, 
aikoxy, aryl or aryloxy. 
40 [0185] In a preferred embodiment, the photodeavable linkers have formula II: 



45 



50 




55 

where R 20 is w-(4,4'-dimethoxytrityloxy)alkyl, o>-hydroxyalkyl or alkyl; R 21 is selected from hydrogen, alkyl, aryl, atkox- 
ycarbonyl, aryloxycarbonyl and carboxy; R 22 is hydrogen or (dialkylamino)(a>-cyanoalkoxy)P-; and X 20 is hydrogen, 
alkyl or OR 20 . 
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[0186] In particularly preferred embodiments, R 20 is 3-(4,4 , -dimethoxytrityloxy)propyl, 3-hydroxypropyl or methyl; 
R 21 is selected from hydrogen, methyl and carboxy; R 22 is hydrogen or (diisopropylamino)(2-cyanoethoxy)P-; and X 20 
is hydrogen, methyl or OR 20 . In a more preferred embodiment, R 20 is 3-(4,4'-dimethoxytritytoxy)propyl; R 21 is methyl; 
R 22 is (diisopropylarnino)(2-cyanoethoxy)P-; and X 20 is hydrogen. In another more preferred embodiment, R 20 is me- 
thyl; R 21 is methyl; R 22 is (diisopropylamino)(2-cyanoethoxy)P-; and X 20 is 3'(4,4'-dimethoxytrityloxy)propoxy. 
[0187] In another embodiment, the photocleavable linkers have formula III: 




where R 23 is hydrogen or (diaJkylamino)(<*>-cyanoalkoxy)P-; and R 24 is selected from <o-hydroxyalkoxy, o>-(4,4 , -dimeth- 
oxytrityloxy)alkoxy, o-hydroxyalkyl and o>-(4,4 , -dimethoxytrityloxy)afkyl, and is unsubstftuted or substituted on the alkyl 
or alkoxy chain with one or more alkyl groups; r and s are each independently 0-4; and R 50 is alkyl, alkoxy, aryl or 
aryloxy. In certain embodiments, R 24 is co-hydroxyalkyl or <B-(4,4'-dimethoxytrityloxy)alkyl, and is substituted on the 
alkyl chain with a methyl group. 

[01 88] I n preferred embodiments, R 23 is hydrogen or (diisopropylamino)(2-cyanoethoxy) P-; and R 24 Is selected from 
3-hydroxypropoxy, 3-(4,4'-dimethoxytrityloxy)propoxy, 4-hydroxybutyl, 3-hydroxy-1 -propyl, t-hydroxy-2-propyl, 3-hy- 
droxy-2-methyl-1 -propyl, 2-hydroxyethyl, hydroxymethyl, 4-(4,4'-dimethoxytrrtyloxy)buty1 < 3-(4,4 , -dimethoxytrityloxy)- 
1 -propyl, 2-(4,4 , -dimethoxytrityloxy)ethyl, 1 -(4,4 , -dimethoxytrityloxy)-2-propyl, 3-(4,4 , -dimethoxytriyloxy)-2-methyl- 
1 -propyl and 4,4 , -dimethyoxytrityloxymethyl. 

[0189] In more preferred embodiments, R 23 is (diisopropylarnino)(2-cyanoethoxy)P-; rand s are O; and R 24 is se- 
lected from 3-(4,4 , -dimethoxytrityloxy)propoxy, 4«(4 l 4 , -dimethoxytrityioxy)butyl, 3-(4,4 , -dimethoxytrrtyloxy)propyl, 2- 
(4,4'-dimethoxytrityioxy)ethyl, 1 -(4,4'-dimethoxytrityloxy)-2-propyl, 3-<4,4 , -dimethoxytriyloxy)-2-methyl-1 -propyl and 
4,4'-dimethyoxytrityloxymethyl. R 24 is most preferably 3-(4 1 4 l -dimethoxytrityloxy)propoxy. 

Preparation of the photocleavable linkers 

A. Preparation of photocleavable linkers of formulae I or II 

[0190] Photocleavable linkers of formulae I or II may be prepared by the methods described below, by minor modi- 
fication of the methods by choosing the appropriate starting materials or by any other methods known to those of skill 
in the art. Detailed procedures for the synthesis of photocleavable linkers of formula II are provided in the Examples. 
[0191] In the photocleavable linkers of formula II where X 20 is hydrogen, the linkers may be prepared In the following 
manner. Alkylation of 5-hydroxy-2-nitrobenzaldehyde with an co-hydroxyalkyl hatide, ejj., 3-hydroxypropyl bromide, 
followed by protection of the resulting alcohol as, e.g. , a silyl ether, provides a 5-(a>-sily loxyalkoxy>2-nrtrobenzaldehyde. 
Addition of an organ ometallic to the aldehyde affords a benzyl ic alcohol. Organometallics which may be used include 
triaikylaluminums (for linkers where R 21 Is aikyl), such as trimethylaluminum, borohydrides (for linkers wtiere R 21 is 
hydrogen), such as sodium borohydride, or metal cyanides (for linkers where R 21 is carboxy or alkoxycarbonyl), such 
as potassium cyanide. In the case of the metal cyanides, the product of the reaction, a cyanohydrin, would then be 
hydrolyzed under either acidic or basic conditions in the presence of either water or an alcohol to afford the compounds 
of interest. 

[0192] The silyl group of the side chain of the resulting benzyl ic alcohols may then be exchanged for a 4,4'-dimeth- 
oxytriyl group by desifylation with, ejj., tetrabutytammonium fluoride, to give the corresponding alcohol, followed by 
reaction with 4,4'-dimethoxytrityl chloride. Reaction with, e.g. , 2-cyanoethyl diisopropylchlorophosphoramidite affords 
the linkers where R 22 is (dlalkylamino)(a>-cyanoalkoxy)P-. 

[0193] A specific example of a synthesis of a photocleavable linker of formula II is shown in the following scheme. 
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which also demonstrates use of the linker in oligonucleotide synthesis. This scheme is intended to be illustrative only 
and in no way limits the scope of the invention. Experimental details of these synthetic transformations are provided 
in the Examples. 



HO 
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[0194] Synthesis of the linkers of formula II where X 20 is OR 20 , 3 ,4- dl hydroxy acetophenone is protected selectively 
at the 4-hydroxyJ by reaction with, e.g. , potassium carbonate and a siryl chloride, Benzoate esteres, propiophenones, 
butyrophenones, etc. may be used in place of the acetophenone. The resulting 4-silyloxy-3-hydroxyacetophenone is 
then alkylated at the with an alkyl halide (for linkers where R 20 is alky I) at the 3-hydroxyl and desilylated with, e.g. , 
tetrabuy I ammonium fluoride to afford a 3-alkoxy-4-hydroxy acetophenone. This compound is then alkylated at the 4-hy- 
droxyl by reaction with an co-hydroxyalkyl halide, ejj.. 3-hydroxypropyl bromide, to give a4-(o-hydroxyalkoxy)-3-alkoxy- 
acetophenone. The side chain alcohol is then protected as an ester, ejj.. an acetate. This compound is then nitrated 
at the 5-position with, ej^, concentrated nitric acfd to provide the corresponding 2-nitroacetop hen ones. Saponification 
of the side chain ester with, e.g. , potassium carbonate, and reduction of the ketone with, sodium borohydride, in 
either order gives a 2-nitro-4-(a>-hydroxyaIkoxy)-5-alkoxybenzylic alcohol. 

[0195] Selective protection of the side chain alcohol as the corresponding 4,4 , -dimethoxytrttyl ether is then accom- 
plished by reaction with 4,4'-dimethoxytrityl chloride. Further reaction with, e.g. , 2-cyanoethyl diisopropylchlorophos- 
phoramidite affords the linkers where R 22 is (dialkylamino)(a)-cyanoalkoxy)P-. 

[0196] A specific example of the synthesis of a photocieavable linker of formula li is shown the following scheme. 
This scheme Is intended to be illustrative only and in no way limit the scope of the Invention. Detailed experimental 
procedures for the transformations shown are found in the Examples. 
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B. Preparation of photocleavable linkers of formula III 

[0197] Photocleavable linkers of formula III may be prepared by the methods described below, by minor modification 
of the methods by choosing appropriate starting materials, or by other methods known to those of skill in the art. 
[0198] In general, photocleavable linkers of formula III are prepared from co-hydroxyaJkyl- or alkoxyaryl compounds, 
in particular eo-hydroxy-alkyl or alkoxy-benzenes. These compounds are commercially available, or may be prepared 
from an (o-hydroxyaJkyl hallde (e.g. , 3-hydroxypropyl bromide) and either phenyllithium (for the o-hydroxyalky ben- 
zenes) or phenol (for the o-hydroxyalkoxybenzenes). Acylation of the oo-hydroxyi group (e.g. , as an acetate ester) 
followed by Friedel-Crafts acylation of the aromatic ring with 2-nltrobenzoyl chloride provides a 4-(co-acetoxy-aikyl or 
alkoxy)-2-nitrobenzophenone. Reduction of the ketone with, e.g. , sodium boro hydride, and saponification of the side 
chain ester are performed In either order to afford a 2-nitrophenyl-4-(hydroxy-alky! or alkoxy)phenylmethanol. Protec- 
tion of the terminal hydroxyl group as the corresponding 4,4'-dimethoxytrityl ether Is achieved by reaction with 4,4'- 
dimethoxytrityl chloride. The benzylic hydroxyl group is then reacted with, e.g. , 2-cyanoethyl diisopropytchlorophos- 
phoramidite to afford linkers of formula II where R 23 is (dialkylamino)(e>-cyanoalkoxy)P-. 

[0199] Other photocleavable linkers of formula III may be prepared by substituting 2-phenyM -propanol or 2-phenyl- 



22 



EP1 164 203 A2 



methyl-1 -propanol for the a>-hydroxy-alkyl or alkoxy-benzenes in the above synthesis. These compounds are commer- 
cially available, but may also be prepared by reaction of t e.g., phenylmagnesium bromide or be nzylmagnesium bromide, 
with the requisite oxlrane (Le., propylene oxide) in the presence of catalytic cuprous ion. 

Chemically cleavable linkers 

[0200] A variety of chemically cleavable linkers may be used to introduce a cleavable bond between th e immobilized 
nucleic acid and the solid support. Acid-labile linkers are presently preferred chemically cleavable linkers for mass 
spectrometry, especially MALDI-TOF MS, because the acid labile bond is cleaved during conditioning of the nucleic 
acid upon addition of the 3-HPA matrix solution. The acid labile bond can be introduced as a separate linker group, e. 
g. , the acid labile trityi groups (see Figure 68; Example 1 6) or may be Incorporated in a synthetic nucleic acid linker by 
introducing one or more silyl intern ucleoside bridges using diisopropylsiryl, thereby forming diisopropylsilyl-linked oli- 
gonucleotide analogs. The diisopropylsiryl bridge replaces the phoshodiester bond in the DNA backbone and under 
mildly acidic conditions, such as 1 .5% trifluoroacetlc add (TFA) or 3-HPA/1% TFA MALDI-TOF matrix solution, results 
In the Introduction of one or more intra-strand breaks in the DNA molecule. Methods for the preparation of diisopro- 
pylsiiyl-linked oligonucleotide precursors and analogs are known to those of skill in the art (see e^, Sana et al. (1 993) 
J. . Org. Chem. 68 :7827-7831 ). These oligonucleotide analogs may be readily prepared using solid state oligonucleotide 
synthesis methods using diisopropylsilyl derivatized deoxyribo nucleosides. 

Nucleic acid conditioning 

[0201] Prior to mass spectrometry analysis, it may be useful to "condition" nucleic acid molecules, for example to 
decrease the laser energy required for volatilization and/or to minimize fragmentation. Conditioning is preferably per- 
formed while a target detection site is immobilized. An example of conditioning is modification of the phosphodiester 
backbone of the nucleic acid molecule ( e.g. , cation exchange), which can be useful for eliminating peak broadening 
due to a heterogeneity in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating 
agent such as akyliodide, iodoacetamide, 0-iodoethanol, or 2,3-epoxy-1 -propanol, the monothk) phosphodiester bonds 
of a nucleic acid molecule can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be 
transformed to uncharged derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nu- 
cleotides that reduce sensitivity for depurination (fragmentation during MS) e.g. , a purine analog such as N7- or N9-dea- 
zapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate func- 
tions which are alkylated or employing oligonucleotide mimetics such as PIMA. 

Multiplex reactions 

[0202] For certain applications, it may be useful to simultaneously detect more than one (mutated) loci on a particular 
captured nucleic acid fragment (on one spot of an array) or it may be useful to perform parallel processing by using 
oligonucleotide or oligonucleotide mimetic arrays on various solid supports. "Multiplexing" can be achieved by several 
different methodologies. For example, several mutations can be simultaneously detected on one target sequence by 
employing corresponding detector (probe) molecules (e.g. , oligonucleotides or oligonucleotide mimetics). The molec- 
ular weight differences between the detector oligonucleotides D1, 02 and D3 must be large enough so that simulta- 
neous detection (multiplexing) is possible. This can be achieved either by the sequence itself (composition or length) 
or by the introduction of mass-modifying functionalities M1 -M3 into the detector oligonucleotide (seel FIGURE 2). 

Mass modification of nucleic acids 

[0203] Mass modifying moieties can be attached, for instance, to eitherthe 5*-end of the oligonucleotide (M 1 ), to the 
nucleobase (or bases) (M 2 , M 7 ), to the phosphate backbone (M 3 ), and to the 2'-position of the nucleoside (nucleosides) 
(M 4 , M 6 ) and/or to the terminal 3*-positlon (M 5 ). Examples of mass modifying moieties include, for example, a halogen, 
an azldo, or of the type, XR, wherein X Is a linking group and R is a mass-modifying functionality. The mass-modifying 
functionality can thus be used to introduce defined mass increments into the oligonucleotide molecule. 
[0204] The mass-modifying functionality can be located at different positions within the nucleotide moiety (see, e.g. , 
U.S. Patent No. 5,647,835 and International PCT application No. WO 94/21822). For example, the mass-modifying 
moiety, M, can be attached either to the nucleobase, M 2 (in case of the c 7 -deazanucleosJdes also to C-7, M 7 ), to the 
triphosphate group at the alpha phosphate, M 3 , or to the 2*-position of the sugar ring of the nucleoside triphosphate, 
M 4 and M 6 . Modifications introduced at the phosphodiester bond (M4), such as with alpha-thio nucleoside triphos- 
phates, have the advantage that these modifications do not interfere with accurate Watson-Crick base-pairing and 
additionally allow for the one-step post-synthetic site-specific modification of the complete nucleic add molecule e.g. , 
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via alkylation reactions (see, e.^ ., Nakamaye et aL (198B) NucL Acids Res, 16 :9947-59). Particularly preferred mass- 
modifying functionalities are boron -modified nucfeic acids since they are better incorporated into nucleic acids by 
polymerases (see, e a. , Porter et aj. (1 995) Biochemistry 34 : 1 1 963-1 1 969; Hasan et al. (1 996) Nucleic Acids Res. 24 : 
21 50-21 57; Li et aL (1 995) Nuci. Acids Res. 23:4495-4501 ). 

5 [0205] Furthermore, the mass-modifying functionality can be added so as to affect chain termination, such as by 
attaching it to the 3-position of the sugar ring in the nucleoside triphosphate, M 5 . For those skilled in the art, it is clear 
that many combinations can be used in the methods provided herein. In the same way, those skilled in the art will 
recognize that chain-elongating nucleoside triphosphates can also be mass-modified in a similar fashion with numerous 
variations and combinations in functionality and attachment positions. 

io [0206] Without being bound to any particular theory, the mass-modification, M, can be introduced for X in XR as well 
as using oligoVpolyethylene glycol derivatives for R. The mass-modifying increment in this case is 44, i.e. five different 
mass-modified species can be generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m = 
1), 133 (m = 2), 177 (m = 3) and 221 (m = 4) to the nucleic acid molecule (e.g. , detector oligonucleotide (D) or the 
nucleoside triphosphates (FIGURE B(C)), respectively). The oligo/poiyethylene glycols can also be monoalkylated by 

'5 a lower alkyl such as methyl, ethyl, propyl, isopropyl, t-butyl and the like. A selection of linking functionalities, X, are 
also illustrated. Other chemistries can be used in the mass-modified compounds (see, e.g. , those described in Oligo- 
nucleotides and Analogues, A Practical Approach , F. Eckstein, editor, IRL Press, Oxford, 1991). 
[0207] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethytene glycols, 
can be selected and attached via appropriate linking chemistries, X. A simple mass-modification can be achieved by 

2° substituting H for halogens like F, CJ, Br and/or I, or pseudohalogens such as CN, SCN, NCS, or by using different 
alkyl, aryl or aralkyi moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, 
or functional groups such as CH 2 F, CHF 2 , CF 3 , Si(CH 3 ) 3 , Si(CH3) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H5) 3 . Yet another 
mass-modification can be obtained by attaching homo- or heteropeptides through the nucleic acid molecule (e.g. , 
detector (D)) or nucleoside triphosphates. One example, useful in generating mass-modified species wtth a mass 

25 increment of 57. is the attachment of oiigoglycines, e.g. , mass-modifications of 74 (r-t , m=0), 131 (r=1 , m = 1), 188 
(r = 1 , m as 2), 245 (r = 1 , m = 3) are achieved. Simple oligoamides also can be used, e.g. , mass -modifications of 74 
(r = 1, m = 0), 88 (r=2, m = 0), 102 (r=3, m = 0), 116(r=4, m = 0), etc. are obtainable. Variations in additions to those 
set forth herein will be apparent to the skilled artisan. 

[0208] Different mass-modified detector oligonucleotides can be used to simultaneously detect all possible variants/ 
30 mutants simultaneously (FIGURE 6B). Alternatively, all four base permutations at the site of a mutation can be detected 
by designing and positioning a detector oligonucleotide, so that it serves as a primer for a DNA/RN A polymerase with 
varying combinations of elongating and terminating nucleoside triphosphates (FIGURE 6C). For example, mass mod- 
ifications also can be incorporated during the amplification process. 

[0209] FIGURE 3 shows a different multiplex detection format, in which differentiation is accomplished by employing 
35 different specific capture sequences which are position-specifically Immobilized on a flat surface ( e.g. , a 'chip array*). 
If different target sequences Tl-Tn are present, their target capture sites TCS1-TCSn will specifically interact with 
complementary immobilized capture sequences C1-Cn. Detection is achieved by employing appropriately mass dif- 
ferentiated detector oligonucleotides D1-Dn, which are mass modifying functionalities M1-Mn. 

40 Mass spectrometrlc methods for sequencing DNA 

[0210] Amenable mass spectrometry formats for use herein include the ionization (I) techniques, such as matrix 
assisted laser desorption ionization (MALDl), electrospray (ESI) (e.g. , continuous or pulsed); and related methods (e. 
C}., lonspray, Thermospray, Fast Atomic Bombardment), and massive cluster impact (MCI); these ion sources can be 
45 matched with detection formats including lin-llnear fields) time-of-flight (TOF), single or multiple quadrupole, single or 
multiple magnetic sector, Fourier transform ion cyclotron resonance (FTICR), ion trap, or combinations of these to give 
a hybrid detector (e.g. , ion trap - time of flight). For ionization, numerous matrix/wavelength combinations including 
frozen anaryte preparation (MALDl) or solvent combinations (ESI) can be employed. 

[0211] Since a normal DNA molecule includes four nucleotide units (A, T, C, G), and the mass of each of these is 
so unique (monoisotoplc masses 313.06, 304.05, 289.05, 329.05 Da, respectively), an accurate mass determination can 
define or constrain the possible base compositions of that DNA. Only above 4900 Da does each unit molecular weight 
have at least one allowable composition; among all 5-mers there is only one non-unique nominal molecular weight, 
among 8-mers, 20. For these and larger oligonucleotides, such mass overlaps can be resolved with the — 1/1 0 5 (— 
1 0 part per million, ppm) mass accuracy available with high resolution FTICR MS. For the 25-mer AgT^, the 20 com- 
55 position degeneracies when measured at ±0.5 Da is reduced to three (A 5 T 20 , T 4 C 12 G>9, AT 3 C 4 G 16 ) when measured 
with 2 ppm accuracy. Given composition constraints (e.g. , the presence or absence of one of the four bases in the 
strand) can reduce this further (see below). 

[021 2] Medium resolution instrumentation, Including but not exclusively curved field reflectron or delayed extraction 
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time-of-f light MS instruments, can also result in Improved DNA detection for sequencing or diagnostics. Either of these 
are capable of detecting a 9 Da (Am (A-T)) shift in ^30-mer strands generated from, for example primer oligo base 
extension (PROBE), or competitive oligonucleotide single base extension (COSBE), sequencing, or direct detection 
of small amplified products. 

5 

BlomassScan 

[0213] In this embodiment, exemplified in Example 33, two single stranded nucleic acids are individually immobilized 
to solid supports. One support contains a nucleic acid encoding the wild type sequence whereas the other support 

10 contains a nucleic acid encoding a mutant target sequence. Total human genomic DNA is digested with one or more 
restriction endonuclease enzyme resulting in the production of small fragments of double stranded genomic DNA 
(10-1,000 bp). The digested DNA is incubated with the immobilized single stranded nucleic acids and the sample is 
heated to denature the DNA duplex. The immobilized nucleic acid competes with the other genomic DNA strand for 
the complementary DNA strand and under the appropriate conditions, a portion of the complementary DNA strand 
hybridizes to the immobilized nucleic acid resulting in a strand displacement. By using high stringency washing con- 
ditions, the two nucleic acids will remain as a DNA duplex only if there is exact Identity between the immobilized nucleic 
acid and the genomic DNA strand. The DNA that remains hybridized to the immobilized nucleic acid Is analyzed by 
mass spectrometry and detection of a signal in the mass spectrum of the appropriate mass is diagnostic for the wild 
type or mutant allele. In this manner, total genomic DNA can be isolated from a biological sample and screened for 

?o the presence or absence of certain mutations. By immobilizing a variety of single stranded nucleic acids in an array 
format, a panel of mutations may be simultaneously screened for a number of genetic loci (i.e. , multiplexing). 



[0214] In addition, using less stringent washing conditions the hybridized DNA strand may be analyzed by mass 
spectrometry for changes in the mass resulting from a deletion or insertion within the targeted restriction endonuclease 
fragment. 

25 

Primer oligonucleotide base extension 

[0215] As described in detail in the following Example 11, the primer oligo base extension (PROBE) method combined 
with mass spectrometry identifies the exact number of repeat units (Le. the number of nucleotides in homogenous 
30 stretches) as well as second site mutations within a polymorphic region, which are otherwise only detectable by se- 
quencing. Thus, the PROBE technique increases the total number of detectable alleles at a distinct genomic site, 
leading to a higher polymorphism information content (PIC) and yielding a far more definitive identification in for instance 
statistics-based analyses in paternity or forensics applications. 

[0216] The method is based on the extension of a detection primer that anneals adjacent to a variable nucleotide 
35 tandem repeat (VNTR) or a polymorphic mononucleotide stretch using a DNA polymerase in the presence of a mixture 
of deoxyNTPs and those dideoxyNTPs that are not present in the deoxy form. The resulting products are evaluated 
and resolved by M ALDI-TOF mass spectrometry without further labeling of the DNA. In a simulated routine application 
with 28 unrelated individuals, the mass error of this procedure using external calibration was in the worst case 0.38% 
(56-mer), which is comparable to approximately 0.1 base accuracy; routine standard mass deviations are in the range 
to of 0.1% (.03 bases). Such accuracy with conventional electrophoretic methods is not realistic, underscoring the value 
of PROBE and mass spectrometry in forensic medicine and paternity testing. 

[0217] The ultra-high resolution of Fourier Transform mass spectrometry makes possible the simultaneous meas- 
urement of all reactions of a Sanger or Maxam Gilbert sequencing experiment, since the sequence may be read from 
mass differences instead of base counting from 4 tubes. 

45 [0218] Additionally, the mass differences between adjacent bases generated from unilateral degradation in a step- 
wise manner by an exonuclease can be used to read the entire sequence of fragments generated. Whereas UV or 
fluorescent measurements will not discriminate mixtures of the nucleoside/nucleotide which are generated when the 
exonuclease enzyme gets out of phase, this is no problem with mass spectrometry since the resolving power in differ- 
entiating between the molecular mass of dA, dT, dG and dC is more than significant. The mass of the adjacent bases 

50 GdL- nucleotides) can be determined, for example, using Fast Atomic Bombardment (FAB) or Electronspray Ionization 
(ESI) mass spectrometry. 

[0219] New mutation screening over an entire amplified product can be achieved by searching for mass shifted 
fragments generated in an endonuclease digestion as described in detail in the following Examples 4 and 12. 
[0220] Partial sequence information obtained from tandem mass spectrometry (MS") can place composition con- 
55 straints as described in the preceding paragraph. For the 25-mer above, generation of two fragment ions formed by 
collisionally activated dissociation (CAD) which differ by 313 Da discounts T 4 C 12 G9, which contains no A nucleotides; 
confirming more than a single A eliminates AT 3 C 4 G ie as a possible composition. 

[0221] MS n can also be used to determined full or partial sequences of larger DNAs; this can be used to detect, 
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locate, and identify new mutations in a given gene region. Enzymatic digest products whose masses are correct need 
not be further analyzed; those with mass shifts could be isolated in reai time from the complex mixture in the mass 
spectrometer and partially sequenced to locate the new mutation. 

[0222] Table I describes the mutation/polymorphism detection tests that have been developed. 



Table I 



Mutation/Polymorphism Detection Tests 


Clinical Association 


Gene 


Mutation/Polymorphism 


Cystic Fibrosis 


CFTR 


38 disease causing mutations in 14 exons/introns | 


Heart Disease (Cholesterol 
Metabolism) 


Apo E 
Apo A-IV 
Apo B-100 


112R, 112C, 158R, 15BC I 
347S, 347T, 360H, 360Q 
3500Q, 3500R 


Thyroid Cancer 


RET proto- 
oncogene 


C634W, C634T, C634R, 
C634S, C634F 


Sickle Cell Anemia/ Thalassemia 


beta-globin 


Sickle cell anemia S and C 45 thalassemia alleles 


HIV Susceptibility 


CKR-5 


32bp deletion | 


Breast Cancer Susceptibility 


BRCA-2 


2bp (AG) deletion in exon 2 


| Thrombosis 


Factor V 


R506Q 


I Arteriosclerosis 


Gpllla E-selectin 


L33P S128R | 


j Hypertension 


ACE 


l/D polymorphism | 



Detection of mutations 



Diagnosis of genetic diseases 

[0223] The mass spectrometric processes described above can be used, for example, to diagnose any of the more 
than 3000 genetic diseases currently known (e.g. , hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMO), 
Huntington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF)) or to be identified. 
[0224] The following Example 3 provides a mass spectrometric method for detecting a mutation (AF508) of the cystic 
fibrosis transmembrane conductance regulator gene (CFTR), which differs by only three base pairs (900 daltons) from 
the wild type of CFTR gene. As described further in Example 3, the detection is based on a single-tube, competitive 
oligonucleotide single base extension (COSBE) reaction using a pair of primers with the S'-terminal base complemen- 
tary to either the normal or mutant allele. Upon hybridization and addition of a polymerase and the nucleoside triphos- 
phate one base downstream, only those primers properly annealed (i..e, no 3-termfnal mismatch) are extended; prod- 
ucts are resolved by molecular weight shifts as determined by matrix assisted laser desorptlon Ionization tlme-of -flight 
mass spectrometry. For the cystic fibrosis AF508 polymorphism, 28-mer 'normal 1 (N) and 30-mer 'mutant* (M) primers 
generate 29- and 31 -mers for N and M homozygotes, respectively, and both for heterozygotes. Since primer and product 
molecular weights are relatively low (< 1 0 kDa) and the mass difference between these are at least that of a single ~ 
300 Da nucleotide unit, low resolution instrumentation is suitable for such measurements. 

[0225] Thermosequence cycle sequencing, as further described in Example 1t , Is also useful for detecting a genetic 
disease. 

[0226] In addition to mutated genes, which result in genetic disease, certain birth defects are the result of chromo- 
somal abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's 
Syndrome), Monosomy X (T urner*s Syndrome) and other sex chromosome aneuploidies such as Klienfe Iter's Syn- 
drome (XXY). Here, "house-keeping" genes encoded by the chromosome in question are present in different quantity 
and the different amount of an amplified fragment compared to the amount In a normal chromosomal configuration can 
be determined by mass spectrometry. 

[0227] Further, there is growing evidence that certain DNA sequences may predispose an Individual to any of a 
number of diseases such as diabetes, arteriosclerosis, obesity, various autoimmune diseases and cancer (e.g. , color- 
ectal, breast, ovarian, lung). Also, the detection of "DNA fingerprints", ejj., polymorphisms, such as "mini- and micro- 
satellite sequences", are useful for determining identity or heredity (e.g. , paternity or maternity). 
[0228] The following Examples 4 and 12 provide mass spectrometer based methods for identifying any of the three 
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different isoforms of human apollpoprotein E, which are coded by the E2, E3 and E4 alleles. For example, the molecular 
weights of DNA fragments obtained after restriction with appropriate restriction endonucleases can be used to detect 
the presence of a mutation and/or a specific allele. 

[0229] Depending on the biological sample, the diagnosis for a genetic disease, chromosomal aneupioidy or genetic 
predisposition can be preformed either pre- or post-natally. 

Diagnosis of cancer 

[0230] Preferred mass spectrometer-based methods for providing an early indication of the existence of a tumor or 
a cancer are provide herein. For example, as described in Example 13, the telomeric repeat amplification protocol 
(TRAP) in conjunction with telomerase specific extension of a substrate primer and a subsequent amplification of the 
telomerase specific extension products by an amplification step using a second primer complementary to the repeat 
structure was used to obtain extension ladders, that were easily detected by MALDI-TOF mass spectrometry as an 
indication of telomerase activity and therefor tumorigenesis. 

[0231] Alternatively, as described in Example 14, expression of a tumor or cancer associated gene (e^g., human 
tyrosine 5-hydroxylase) via RT-PCR and analysis of the amplified products by mass spectrometry can be used to detect 
the tumor or cancer (eg. , biosynthesis of catecholamine via tyrosine 5-hydroxylase is a characteristic of neuroblasto- 
ma). 

[0232] Further, a primer oligo base extension reaction and detection of products by mass spectrometry provides a 
rapid means for detecting the presence of oncogenes, such as the RET proto oncogene codon 634, which is related 
to causing multiple endocrine neoplasia, type II (MEN II), as described in Example 15. 

Diagnosis of infection 

[0233] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are 
different from the sequences contained in the host cell. Detecting or quantitating nucleic acid sequences that are 
specific to the infectious organism is important for diagnosing or monitoring infection. Examples of disease causing 
viruses that infect humans and animals and which may be detected by the disclosed processes include: Retrovlridae 
(eg., human immunodeficiency viruses, such as HIV-1 (also referred to as HTLV-III, LAV or HTLV-III/LAV, see, e.g. , 
Ratner et al. (1985) Nature 313 : 227-284; Wain-Hobson et ah (1985) Cell 40:9-17); HlV-2 (see, Guyader et aL (1 987) 
Nature 328:662-669 European Patent Publication No. 0 269 520; Chakrabarti et aL (1987) Nature 32B :S43-S47; and 
European Patent Application No. 0 655 501); and other isolates, such as HIV-LP (International PCT application No. 
WO 94/00562 entitled "A Novel Human Immunodeficiency Virus* Picomaviridae (eg., polio viruses, hepatitis A virus, 
(see, eg., Gust et al. ( 1 983) Intervirology 20: 1 -7): enteroviruses, human coxsackieviruses, rhinovlruses, echoviruses)! 
Catciyiridae (eg., strains that cause gastroenteritis); Togaviridae (eg., equine encephalitis viruses, rubella viruses)! 
F/awntfaefeg., dengue viruses, encephalitis viruses, yellow fever viruses); Coronaviridae (e.g. , coronaviruses); Rhab- 
dovirfdae (eg., vesicular stomatitis viruses, rabies viruses); Fifovlrfdae (e.g. , ebola viruses); Paramyxovirfdae (e.g. , 
parainfluenza viruses, mumps virus, measles virus, respiratory syncytial virus); Orthomyxoviridae (eg., influenza vi- 
ruses); Bungaviridae (e^g., Hantaan viruses, bunga viruses, phleboviruses and Nairo viruses); Arena viridae (hemor- 
rhagic fever viruses); Reoviridae (ejg., reoviruses, ortwiruses and rotaviruses); Blrnaviridae, Hepadnaviridae (Hepatitis 
B virus); Parvoviridae (parvoviruses); Papovaviridae (papilloma viruses, polyoma viruses); Adenoviridae (most aden- 
oviruses); Herpesvihdae (herpes simplex virus (HSV) 1 and 2, varicella zoster virus, cytomegaovirus (CM V), herpes 
viruses'); Poxvirldae (variola viruses, vaccinia viruses, pox viruses); and indoviridae (e!g. f African swine fever virus); 
and unclassified viruses (e^g., the etiological agents of Spongiform encephalopathies, the agent of delta hepatitis 
(thought to be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis (class 1 = internally trans- 
mitted; class 2 = parenteral^ transmitted (i.e., Hepatitis C); Norwalk and related viruses, and astroviruses). 
[0234] Examples of infectious bacteria include, but are not limited to: Helicobacter pyloris, Borelia burgdorferi, Le- 
gionella pneumophila, Mycobacteria sps (eg., M. tuberculosis, M. avium, M. intracellular, M. kansali, M. gordorme), 
Staphylococcus aureus, Nelssen'a gonorrhoeae, Neisseria meningitidis. Listeria monocytogenes, Streptococcus pyo- 
genes (Group A Streptococcus), Streptococcus agalactia* (Group B Streptococcus), Streptococcus (virfdans group), 
Streptococcus faecalis, Streptococcus bows, Streptococcus (anaerobic sps.), Streptococcus phhenumoniae, patho- 
genic Campylobacter sp., Enterococcussp., Haemophilus influenzae, Bacillus entracis, corynebacterium diphtheria^ 
corynebacteriumsp., Erysipelothrix rhusiopathiae, Clostridium perf ringers, Clostridium tetani, Enterobacteraerogenes, 
Klebsiella pneumoniae, Pasturella muttodda, Bacteroidessp., Fusobacterium nucfeatum, Streptobaciiius moniliformis, 
Treponema pallidium, Treponema pertenue, Leptospira, and Actinomyces israelii. 

[0235] Examples of infectious fungi include: Cryptococcus neoformans, Histopiasma capsulatum, Coccidioides im- 
mitis, Blastomyces dermatitidis, Chlamydia trachomatis, Candida albicans. Other infectious organisms (i.e., protists) 
include: Plasmodium falciparum and Toxoplasma gondii 
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[0236] The processes provided herein makes use of the known sequence information of the target sequence and 
known mutation sites. Although new mutations can also be detected. For example, as shown in FIGURE 8, transcription 
of a nucieic acid molecule obtained from a biological sample can be specifically digested using one or more nucleases 
and the fragments captured on a solid support carrying the corresponding complementary nucleic acid sequences. 
Detection of hybridization and the molecular weights of the captured target sequences provide information on whether 
and where in a gene a mutation is present. Alternatively, DNA can be cleaved by one or more specific endonucteases 
to form a mixture of fragments. Comparison of the molecular weights between wildtype and mutant fragment mixtures 
results in mutation detection. 

Sequencing by generation of specifically terminated f ragements 

[0237] In another embodiment, an accurate sequence determination of a relatively large target nucleic acid, can be 
obtained by generating specificaJly terminated fragments from the target nucleic acid, determining the mass of each 
fragment by mass spectrometry and ordering the fragments to determine the sequence of the larger target nucleic 
acid. In a preferred embodiment, the specifically terminated fragments are partial or complete base-specifically termi- 
nated fragments. 

[0238] One method for generating base specifically terminated fragments involves using a base-specific rtoo nuclease 
after e.g. , a transcription reaction. Preferred base-specific ribonucleases are selected from among: Tj-ribonudease 
(G-specific), U 2 -ribonucleaee (A-specific), PhyM-ribonuclease U specific and ribonuclease A (U/C specific). Other ef- 
ficient and base-specific ribonucleases can be Identified using the assay described in Example 21 . Preferably modified 
nucleotides are included in the transcription reaction with unmodified nucleotides. Most preferably, the modified nucle- 
otides and unmodified nucleotides are added to the transcription reaction at appropriate concentrations, so that both 
moieties are incorporated at a preferential rate of about t :1 . Alternatively, two separate transcriptions of the target DNA 
sequence one with the modified and one with the unmodified nucleotides can be performed and the results compared. 
Preferred modified nucleotides include: boron or bromine modified nucleotides (Porter etaL (1995) Biochemistry 34 : 
1 1 963-1 1 969; Hasan et aL (1 996) Nucl. Acids Res. 24:21 50-2157; LI et al. (1 995) Nucleic Acids Res. 23:4495-4501 ), 
a-thio- modified nucleotides, as well as mass-modified nucleotides as described above. 

[0239] Another method for generating base specifically terminated fragments involves performing a combined am- 
plification and base-specif ic termination reaction. For example, a combined amplification and termination reaction can 
be performed using at least two different polymerase enzymes, each having a different affinity for the chain terminating 
nucleotide, so that polymerization by an enzyme with relatively low affinity for the chain terminating nucleotide leads 
to exponential amplification whereas an enzyme with relatively high affinity for the chain terminating nucleotide termi- 
nates the polymerization and yields sequencing products. 

[0240] The combined amplification and sequencing can be based on any amplification procedure that employs an 
enzyme with polynucleotide synthetic ability (e.g. , polymerase). One preferred process, based on the polymerase chain 
reaction (PCR), includes the following three thermal steps: 1) denaturing a double stranded (ds) DNA molecule at an 
appropriate temperature and for an appropriate period of time to obtain the two single stranded (ss) DNA molecules 
(the template: sense and antisense strand); 2) contacting the template with at (east one primer that hybridizes to at 
least one ss DNA template at an appropriate temperature and for an appropriate period of time to obtain a primer 
containing ss DNA template; 3) contacting the primer containing template at an appropriate temperature and for an 
appropriate period of time with: (i) a complete set of chain elongating nucleotides, (ii) at least one chain terminating 
nucleotide, (iii) a first DNA polymerase, which has a relatively low affinity towards the chain terminating nucleotide; 
and (iv) a second DNA polymerase, which has a relatively high affinity towards the chain terminating nucleotide. 
[0241] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired 
amount of amplified sequencing ladders. The quantity of the base specifically terminated fragment desired dictates 
how many cycles are performed. Although an increased number of cycles results in an increased level of amplification, 
it may also detract from the sensitivity of a subsequent detection. It is therefore generally undesirable to perform more 
than about 50 cycles, and is more preferable to perform less than about 40 cycles (e.g. , about 20-30 cycles). 
[0242] Another preferred process for simultaneously amplifying and chain terminating a nucleic acid sequence is 
based on strand displacement amplification (SDA) (see, o^ t Walker etaL (1994) Nud. Acids Res. 22 :2670-77; Euro- 
pean Patent Publication Number 0 684 315 entitled "Strand Displacement Amplification Using Thermophilic Enzymes"). 
In essence, this process involves the following three steps, which altogether constitute a cycle: 1) denaturing a double 
stranded (ds) DNA molecule containing the sequence to be amplified at an appropriate temperature and for an appro- 
priate period of time to obtain the two single stranded (ss) DNA molecules (the template: sense and antisense strand); 
2) contacting the template with at least one primer (P), that contains a recognition/cleavage site for a restriction endo- 
nuclease (RE) and that hybridizes to at least one ss DNA template at an appropriate temperature and for an appropriate 
period of time to obtain a primer containing ss DNA template; 3) contacting the primer containing template at an ap- 
propriate temperature and for an appropriate period of time with (i) a complete set of chain elongating nucleotides; (ii) 



28 



BP 1 164 203 A2 



at least one chain terminating nucleotide; (lii) a first ON A polymerase, which has a relatively low affinity towards the 
chain terminating nucleotide; <iv) a second DNA polymerase* which has a relatively high affinity towards the chain 
terminating nucleotide; and (v) an RE that nicks the primer recognition/cleavage site. 

[0243] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired 
amount of amplified sequencing ladders. As with the PCR based process, the quantity of the base specifically termi- 
nated fragment desired dictates how many cycles are performed. Preferably, less than 50 cycles, more preferably less 
than about 40 cycles and most preferably about 20 to 30 cycles are performed. 

[0244] Preferably about 0.5 to about 3 units of polymerase is used in the combined amplification andchain termination 
reaction. Most preferably about 1 to 2 units is used. Particularly preferred polymerases for use In conjunction with PCR 
or other thermal amplification process are thermostable polymerases, such as Taq DNA polymerase (Boehringer Man- 
nheim), AmpliTaq FS DNA polymerase (Perkin- Elmer). Deep Vent (exo), Vent, Vent (exo-) and Deep Vent DNA 
polymerases (New England Bio labs), Thermo Sequenase (Amersham) or exo(-) Pseudococcus furiosus (Pfu) DNA 
polymerase (Stratagene, Heidelberg, Germany). AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, and Pyrococcus furi- 
osus. In addition, preferably the polymerase does not have 5-3' exonuclease activity. 

[0245] In addition to polymerases, which have a relatively high and a relatively low affinity to the chain terminating 
nucleotide, a third polymerase, which has proofreading capacity (e.g. , Pyrococcus woesef (Pwo)) DNA polymerase 
may also be added to the amplification mixture to enhance the fidelity of amplification. 

[0246] Yet another method for generating base specifically terminated fragments involves contacting an appropriate 
amount of the target nucleic acid with a specific endonuclease or exonuclease. Preferably, the original 5' and/or 3' end 
of the nucleic acid is tagged to facilitate the ordering of fragments. Tagging of the 3' end is particularly preferred when 
in vitro nucleic acid transcripts are being analyzed, so that the influence of 3' heterogeneity, premature termination and 
nonspecific elongation can be minimized. 5' and 3' tags can be natural (e.g. , a 3* poly A tail or 5* or 3' heterogeneity) 
or artificial. Preferred 5" and/or 3' tags are selected from among the molecules described for mass-modification above. 
[0247] The methods provided herein are further Illustrated by the following examples, which should not be construed 
as limiting in any way. 

EXAMPLE 1 

MALDI-TOF desorption of oligonucleotides directly on solid supports 

[0248] 1 g CPG (Controlled Pore Glass) was functionaiized with 3-(triethoxysilyi)-epoxypropan to form OH-groups 
on the polymer surface. A standard oligonucleotide synthesis with 1 3 mg of the OH-CPG on a DNA synthesizer (Milligen, 
Model 7500) employing P-cyanoethyl-phosphoamidites (Koster et aL (1 994) Nucleic Acids Res. 12:4539) and TAC N- 
protecting groups (Koster etaL (1981) Tetrahedron 37:362) was performed to synthesize a 3'-T 5 -50mer oligonucleotide 
sequence in which 50 nucleotides are complementary to a "hypothetical" 50mer sequence. T 5 serves as a spacer. 
Deprotection with saturated ammonia in methanol at room temperature for 2 hours furnished according to the deter- 
mination of the DMT group CPG which contained about 10 umol 56mer/g CPG. This 55mer served as a template for 
hybridizations with a 26-mer (with 5'-DMT group) and a 40-mer (without DMT group). The reaction volume Is 100 u.l 
and contains about 1 nmol CPG bound 55mer as template, an equimolar amount of oligonucleotide in solution (26-mer 
or 40-mer) in 20 mM Tris-HCI, pH 7.5, 1 0 mM MgCfe and 25 mM NaCf. The mixture was heated for 10 mln at 65*C 
and cooled to 37*C during 30' (annealing). The oligonucleotide which has not been hybridized to the polymer-bound 
template were removed by centrifugation and three subsequent washing/centrifugation steps with 1 00 ul each of ice- 
cold 50 mM ammoniumcitrate. The beads were alr-drled and mixed with matrix solution (3-hydroxypJcolinic acid/1 OmM 
ammonium citrate in acetonitrile/water, 1:1), and analyzed by MALDI-TOF mass spectrometry. The results are pre- 
sented in Figures 1 0 and 11 . 

EXAMPLE 2 

Electrospray (ES) desorption and differentiation of an 18-mer and 19-mer 

[0249] DNA fragments at a concentration of 50 pmole/ul in 2-propanof/1 OmM ammonlumcarbonate (1/9, v/v) were 
analyzed simultaneously by an electrospray mass spectrometer. 

[0250] The successful desorption and differentiation of an 1 8-mer and 1 9-mer by electrospray mass spectrometry 
is shown in FIGURE 12. 
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EXAMPLE 3 

Detection of The Cystic Fibrosis Mutation AF508, by single step dldeoxy extension and analysis by MALDh 
TOP mass spectrometry (Competitive Oligonucleotide Simple Base Extension = COS BE) 

5 

[0251 1 The principle of the COSBE method is shown in FIGU RE 1 3, N being the normal and M the mutation detection 
primer, respectively. 

MATERIALS AND METHODS 

[0252] PCR Amplification and Strand Immobilization. Amplification was carried out with exon 10 specific primers 
using standard PCR conditions (30 cycles: r@95 0 C, V@5S°C, 2'@72°C); the reverse primer was 5' labelled with 
blotin and column purified (Oligopurification Cartridge, Cruachem). After amplification the amplified products were 
purified by column separation (Qiagen Quickspin) and immobilized on streptavidin coated magnetic beads (Dynabeads, 
is Dynal, Norway) according to their standard protocol; DNA was denatured using 0.1 M NaOH and washed with 0.1M 
NaOH, 1xB+W buffer and TE buffer to remove the non-biotinylated sense strand. 

[0253] COSBE Conditions. The beads containing ligated antisense strand were resuspended In 18uJ of Reaction 
mix 1 (2 uJ 10X Taq buffer, 1 u.L (1 unit) Taq Polymerase, 2 uL of 2 mM dGTP, and 13 u.L H 2 0) and incubated at 80*C 
for 5* before the addition of Reaction mix 2 (1 00 ng each of COSBE primers). The temperature was reduced to 60°C 
20 and the mixtures incubated for a 5 1 annealing/extension period; the beads were then washed in 25mM triethylammonium 
acetate (TEAA) followed by 50mM ammonium citrate. 

[0254] Primer Sequences. All primers were synthesized on a Perseptive Biosystems Expedite 8900 DNA Synthesizer 
using conventional phosphoramidite chemistry (Sinha et al. (1 984) Nucleic Acids Res. 12 :4539). COSBE primers (each 
containing an intentional mismatch one base before the 3'-terminus) were those used In a previous ARMS study (Feme 
25 ®I §L (1992) Am J Hum Genet 51:251*262) with the exception that two bases were removed from the 5' -end of the 
normal: 

Ex1 0 PCR (Forward): 5'-BIO-GCA AGT GAA TCC TGA GCG TG-3* {SEQ ID No. 1) 
Ex10 PCR (Reverse): 5'-GTG TGA AGG GTT CAT ATG C-3' (SEQ ID No. 2) 
30 COSBE AF508-N 5*-ATC TAT ATT CAT CAT AGG AAA CAC CAC A-3' (28-mer) (SEQ ID No. 3) 

COSBE AF508-N 5'-GTA TCT ATA TTC ATC ATA GGA AAC ACC ATT-3* (30-mer) (SEQ ID No. 4) 

[0255] Mass Spectrometry. After washing, beads were resuspended In 1 u.L 1 8 Mohm/cm H 2 0 . 300 nL each of matrix 
(Wu et al. (1993) Rapid Commun. Mass Spectrom. 7:142-146) solution (0.7 M 3-hydroxypicolinic acid, 0.7 M dibasic 

55 ammonium citrate in 1 :1 h^OrCh^CN) and resuspended beads (Tang et al. (1 995) Rapid Commun Mass Spectrom B : 
727-730) were mixed on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk 
for introduction into the source region of an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI- 
TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical 
average molecular weights (M r (calc)) were calculated from atomic compositions. Vendor provided software was used 

-«? to determine peak centroids using external calibration ; 1 .08 Da has been subtracted from these to correct for the charge 
carrying proton mass to yield the text M f (exp) values. 

[0256] Scheme. Upon annealing to the bound template, the N and M primers (8508.6 and 9148.0 Da, respectively) 
are presented with dGTP; only primers with proper Watson-Crick base paring at the variable (V) position are extended 
by the polymerase. Thus if V pairs with the 3'-terminal base of N, N is extended to a 8837.9 Da product (N+ 1). Likewise, 
« if V Is property matched to the M terminus, M is extended to a 9477.3 Da M + product. 

Results 

[0257] Figures 14-18 show the representative mass spectra of COSBE reaction products. Better results were ob- 
50 tained when amplified products were purified before the biotlnylated anti-sense strand was bound. 

EXAMPLE 4 

Differentiation of Human Apo lipoprotein E Isoforms by Mass Spectrometry 

55 

[0258] Apo lipoprotein E (Apo E), a protein component of lipoproteins, plays an essential role in lipid metabolism. For 
example, it is involved with cholesterol transport, metabolism of lipoprotein particles, immunoregulation and activation 
of a number of lipolytic enzymes. 
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[0259] There are three common isoforms of human Apo E (coded by E2, E3 and E4 alleles). The most common is 
the E3 allele. The E2 allele has been shown to decrease the cholesterol level in plasma and therefore may have a 
protective effect against the development of atherosclerosis. The DNA encoding a portion of the E2 allele is set forth 
in SEQ ID No. 130. Finally, the E4 isoform has been correlated with increased levels of cholesterol, conferring predis- 
position to atherosclerosis. Therefore, the identity of the apo E allele of a particular individual is an important determinant 
of risk for the development of cardiovascular disease. 

[0260] As shown in Figure 1 9, a sample of DNA encoding apolipoprotein E can be obtained from a subject, amplified 
( e g , via PCR); and the amplified product can be digested using an appropriate enzyme (e.g. , Cfol). The restriction 
digest obtained can then be analyzed by a variety of means. As shown in Figure 20, the three isotypes of apolipoprotein 
E (E2, E3 and E4 have different nucleic acid sequences and therefore also have distinguishable molecular weight 
values. 

[0261] As shown in Figure 21 A-C, different Apolipoprotein E genotypes exhibit different restriction patterns in a 3.5% 
MetPhor Agarose Gel or 12% polyacryiamlde gel. As shown in Figures 22 and 23, the vahous apolipoprotein E geno- 
types can also be accurately and rapidly determined by mass spectrometry. 

EXAMPLE 5 

Detection of hepatitis B virus in serum samples. 
MATERIALS AND METHODS 

Sample preparation 

[0262] Phenol/choloform extraction of viral DNA and the final ethanol precipitation was done according to standard 
protocols. 

First PCR 

[0263] Each reaction was performed with 5u.l of the DNA preparation from serum. 1 5 pmol of each primer and 2 units 
Taq DNA polymerase (Perkin Elmer, Weiterstadt, Germany) were used. The final concentration of each dIMTP was 
200U.MM, the final volume of the reaction was 50 uJ. 10X PCR buffer (Perkin Efrner, Weiterstadt, Germany) contained 
100 mM Tris-HCI, pH 8.3, 500 mM KCI, 15 mM MgCfe, 0.01% gelatine (w/v). Primer sequences: 

Primer SEQUENCE SEQ ID No. 

1 5'-GCTTTGGGGCATGGACATTGACCCGTATAA-3' 5 

2 5'-CTGACTACTAATTCCCTGGATGCTGGGTCT-3' 6 

Nested PCR: 

[0264] Each reaction was performed either with 1 u.l of the first reaction or with a 1 :10 dilution of the first PCR as 
template, respectively. 100 pmol of each primer, 2.5 u P/u(exo~) DNA polymerase (Stratagene, Heidetoerg, Germany), 
a final concentration of 200 uA/l of each dNTPs and 5 u.1 10x Pfu buffer (200 mM Tris-HCI, pH 8.75, 100 mM KCI, 100 
mM (NH^gSC^, 1% Triton X-100, 1 mg/ml BSA, (Stratagene, Heidelberg, Germany) were used in a final volume 50 
jU. The reactions were performed in a thermocycfer (OmniGene, MWG-Biotech, Ebersberg, Germany) using the fol- 
lowing program: 92°C for 1 minute, 60°C for 1 minute and 72° C for 1 minute with 20 cycles. Sequence of oligodeox- 
ynucleotides (purchased HPLC-purtfled from MWG-Biotech, Ebersberg, Germany): 

HBV13: 5 * -TTG C CTG AGTGC AGT ATG GT- 3' (SEQ ID NO. 7) 

HBV15bio: Biotin-5'-AGCTCTATATCGGGAAGCCT-3 r (SEQ 10 NO. 8) 
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Purification of amplified products: 

[0265] For the recording of each spectrum, one PCR, 50 ni r (performed as described above) was used. Purification 
was done according to the following procedure: Ultrafiltration was done using Ultrafree-MC filtration units (Millipore, 
Eschbom, Germany) according to the protocol of the provider with centrifugation at 8000 rpm for 20 minutes. 25uJ 
(1 0ng/uJ) streptavidin Dynabeads (Dynal, Hamburg, Germany) were prepared according to the instructions of the man- 
ufacturer and resuspended in 25uJ of B/W buffer (10 mM Tris-HCi, pH 7.5, 1mM EDTA, 2 M NaCI). This suspension 
was added to the PCR samples still in the filtration unit and the mixture was incubated with gentle shaking for 15 
minutes at ambient temperature. The suspension was transferred in a 1 .5 ml Eppendorf tube and the supernatant was 
removed with the aid of a Magnetic Particle Collector, MPC, (Dynal, Hamburg, Germany). The beads were washed 
twice with 50 uJ of 0.7 M ammonium citrate solution, pH 8.0 (the supernatant was removed each time using the MPC). 
Cleavage from the beads can be accomplished by using formamide at 90*C. The supernatant was dried in a speedvac 
for about an hour and resuspended In 4 uJ of ultrapure water (MilliQ UF plus Millipore, Eschborn, Germany). This 
preparation was used for MALDI-TOF MS analysis. 

MALDI-TOF MS: 

[0266] Half a microliter of the sample was pipetted onto the sample holder, then immediately mixed with 0.5 uJ matrix 
solution (0.7 M3-hydroxypicolinic acid 50% acetonitriie, 70 mM ammonium citrate). This mixture was dried at ambient 
temperature and introduced into the mass spectrometer. All spectra were taken in positive ion mode using a Finnigan 
MAT Vision 2000 (Finnigan MAT, Bremen, Germany), equipped with a refiectron (5 kev* ion source, 20 keV postaccel- 
eration) and a 337 nm nitrogen laser. Caltoratlon was done with a mixture of a 40-mer and a 100-mer. Each sample 
was measured with different laser energies. In the negative samples, the amplified product was detected neither with 
less nor with higher laser energies. In the positive samples the amplified product was detected at different places of 
the sample spot and also with varying laser energies. 

RESULTS 

[0267] A nested PCR system was used for the detection of HBV DNA in blood samples employing oligonucleotides 
complementary to the c region of the HBV genome (primer 1 : beginning at map position 1 763, primer 2 beginning at 
map position 2032 of the complementary strand) encoding the HBV core antigen (HBVcAg). DNA was isolated from 
patients serum according to standard protocols. A first PCR was performed with the DNA from these preparations 
using a first set of primers. If HBV DNA was present in the sample a DNA fragment of 269 bp was generated. 
[0268] In the second reaction, primers which were complementary to a region within the PCR fragment generated 
in the first PCR were used. If HBV related amplified products were present in the first PCR a DNA fragment of 67 bp 
was generated (see Fig. 25A) in this nested PCR. The usage of a nested PCR system for detection provides a high 
sensitivity and also serves as a specificity control for the external PCR (Rolfs et al. (1992) PCR: Clinical Diagnostics 
and Research, Springer, Heidelberg). A further advantage is that the amount of fragments generated in the second 
PCR is high enough to ensure an unproblematic detection although purification losses can not be avoided. 
[0269] The samples were purified using ultrafiltration to restreptavidin Dynabeads. This purification was done be- 
cause the shorter primer fragments were immobilized in higher yield on the beads due to stearic reasons. The immo- 
bilization was done directly on the ultrafiltration membrane to avoid substance losses due to unspecific absorption on 
the membrane. Following immobilization, the beads were washed with ammonium citrate to perform cation exchange 
(Pieles et al (1 993) Nucl. Acids Res. 2T31 91 -31 96). The immobilized DNA was cleaved from the beads using 25% 
ammonia which allows cleavage of DNA from the beads in a very short time, but does not result in an Introduction of 
sodium or other cations. 

[0270] The nested PCRs and the MALDI TOF analysis were performed without knowing the results of serological 
analysis. Due to the unknown virus titer, each sample of the first PCR was used undiluted as template and In a 1 :10 
dilution, respectively. 

[0271] Sample 1 was collected from a patient with chronic active HBV infection who was positive in Hbs- and Hbe- 
antlgen tests but negative in a dot blot analysis. Sample 2 was a serum sample from a patient with an active HBV 
infection and a massive vlremia who was HBV positive In a dot blot analysis. Sample 3 was a denatured serum sample 
therefore no serological analysis could be performed by an Increased level of transaminases indicating liver disease 
was detected. In autoradiograph analysis (Figure 24), the first PCR of this sample was negative. Nevertheless, there 
was some evidence of HBV infection. This sample is of interest for MALDI-TOF analysis, because it demonstrates that 
even low-level amounts of amplified products can be detected after the purification procedure. Sample 4 was from a 
patient who was cured of HBV infection. Samples 5 and 6 were collected from patients with a chronic active HBV 
infection. 
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[0272] Figure 24 shows the results of a PAGE analysis of the nested PCR reaction. A amplified product is clearly 
revealed in samples 1, 2, 3 t 5 and 6. In sample 4 no amplified product was generated, it is indeed HBV negative, 
according to the serological analysis. Negative and positive controls are indicated by + and -, respectively. Amplif ication 
artifacts are visible in lanes 2, 5, 6 and + if non-diluted template was used. These artifacts were not generated if the 
template was used in a 1:10 dilution. In sample 3, amplified product was merely detectable if the template was not 
diluted. The results of PAGE analysis are In agreement with the data obtained by serological analysis except for sample 
3 as discussed above. 

[0273] Figure 25A shows a mass spectrum of a nested amplified product from sample number 1 generated and 
purified as described above. The signal at 20754 Da represents the single stranded amplified product (calculated: 
20735 Da, as the average mass of both strands of the amplified product cleaved from the beads). The mass difference 
of calculated and obtained mass is 19 Da (0.09%). As shown In Fig. 25A, sample number 1 generated a high amount 
of amplified product, resulting in an unambiguous detection. 

[0274] Fig. 25B shows a spectrum obtained from sample number 3. As depicted in Fig. 24, the amount of amplified 
product generated in this section is significantly lower than that from sample number 1 . Nevertheless, the amplified 
product is clearly revealed with a mass of 20751 Da (calculated 20735). The mass difference is 16 Da (0.0B%). The 
spectrum depicted in Fig. 25C was obtained from sample number 4 which Is HBV negative (as is also shown in Fig 
24). As expected no signals corresponding to the amplified product could be detected. All samples shown in Fig. 25 
were analyzed with MALDl-TOF MS, whereby amplified product was detected in all HBV positive samples, but not in 
the HBV negative samples. These results were reproduced in several independent experiments. 

EXAMPLE 6 

Analysis of Ligase Chain Reaction Products Via MALDl-TOF Mass Spectrometry 
MATERIALS AND METHODS 

Oligodeoxynudeotides 

[0275] Except the biotinylated one and all other oligonucleotides were synthesized in a 0.2 pmol scale on a M illiGen 
7500 DNA Synthesizer (Miflipore, Bedford, MA, USA) using the p-cyanoethylphosphoamtdite method (Slnha, N.D. et 
aj. (1984) Nucleic Acids Res. 12:4539-4577). The ollgodeoxynucleotides were RP-HPLC-purified and deprotected 
according to standard protocols. The biotinylated oligodeoxynucleotide was purchased (HPLC-purified) from Biometra, 
Gottingen, Germany). Sequences and calculated masses of the oligonucleotides used: 



Oligodeoxy- SEQUENCE SEQ ID 

nucleotide No. 

A 5'-p-TTGTGCCACGCGGTTGGGAATGTA (7521 Da) 9 

B 5'-p-AGCAACGACTGTTTGCCCGCCAGTTG (7948 Da) 1 0 

C 5'-bio-TACATTCCCAACCGCGTGGCACAAC (7960 Da) 1 1 

O 5-p-AACTGGCGGGCAAACAGTCGTTGCT (7708 Da) 12 



5-Phosphorytation of oligonucleotides A and D 

[0276] This was performed with polynucleotide kinase (Boehringer, Mannheim, Germany) according to published 
procedures, the 5'-phosphoryJated oligonucleotides were used unpurified for LCR. 

Ligase chain reaction 

[0277] The LCR was performed with Pfu DNA ligase and a ligase chain reaction kit (Stratagene, Heidelberg, Ger- 
many) containing two different pBluescript Kll phagemids. One carrying the wildtype form of the E.colilac\ gene and 
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the other one a mutant of this gene with a single point mutation at bp 191 of the tad gene. 

[0278] The following LCR conditions were used for each reaction: 100 pg template DNA (0.74 fmol) with 500 pg 
sonified salmon sperm DNA as carrier, 25 ng (3.3 pmol) of each 5'-phosphoryIated oligonucleotide, 20 ng (2.5 pmol) 
of each non-phosphory fated oligonucleotide, 4 U Pfu DNA ligase in a final volume of 20 |xt buffered ss 50-mer was 
used (I fmol) as template, in this case oligo C was also biotinylated. All reactions were performed In a thermocycler 
(OmnlGene, MWG-Biotech, Ebersberg, Germany) with the following program: 4 minutes 92°C, 2 minutes 60°C and 
25 cycles of 20 seconds 92°C, 40 seconds 60°C. Except for HPLC analysis the biotinylated ligation educt C was used. 
In a control experiment the biotinylated and non-biotinytated oligonucleotides revealed the same gel etectrophoretlc 
results. The reactions were analyzed on 7.5% polyacrylamide gels. Ligation product 1 (oligo A and B) calculated mass: 
15450 Da, ligation product 2 (oligo C and D) calculated mass: 15387 Da. 

SMART-HPLC 

[0279] Ion exchange H PLC (IE HPLC) was performed on the SM A RTF-system (Pharmacia, Freiburg, Germany) using 
a Pharmacia Mono Q, PC 1.6/5 column. Eluents were buffer A (25 mM Trie-HCI, 1 mM EDTA and 0.3 M NaCI at pH 
8.0) and buffer B (same as A, but 1 M NaCI). Starting with 1 00% A for 5 minutes at a flow rate of 50jil/min. a gradient 
was applied from 0 to 70% B in 30 minutes, then increased to 100% B In 2 minutes and held at 100% B for 5 minutes. 
Two pooled LCR volumes (40\xf) performed with either wildtype or mutant template were injected. 

Sample preparation for MALDI-TOF-MS 

[0280] Preparation of immobilized DNA: For the recording of each spectrum two LCRs (performed as described 
above) were pooled and diluted 1:1 with 2x B/W buffer (10 mM Tris-HCI, pH 7.5, 1 mM EDTA, 2 M NaCI). To the 
samples 5 uJ streptavidin DynaBeads (Dynal, Hamburg, Germany) were added, the mixture was allowed to bind with 
gentle shaking for 15 minutes at ambient temperature. The supernatant was removed using a Magnetic Particle Col- 
lector, MPC, (Dynal, Hamburg, Germany) and the beads were washed twice with 50 u.1 of 0.7 M ammonium citrate 
solution (pH 8.0) (the supernatant was removed each time using the MPC). The beads were resuspended in 1 uJ of 
ultrapure water (MilliQ, Millipore, Bedford, Mabelow). 

[0281 ] Combination of ultrafiltration and streptavidin DynaBeads: For the recording of spectrum two LCRs (performed 
as described above) were pooled, diluted 1:1 with 2x B/W buffer and concentrated with a 5000 NMWL Ultrafree-MC 
filter unit (Millipore, Eschborn, Germany) according to the instructions of the manufacturer. After concentration the 
samples were washed with 300 |xl 1x B/W buffer to streptavidin DynaBeads were added. The beads were washed once 
on the Ultrafree-MC filtration unit with 300 uJ of 1x B/W buffer and processed as described above. The beads were 
resuspended in 30 to 50 u,l of 1x B/W buffer and transferred in a 1 .5 ml Eppendorf tube. The supernatant was removed 
and the beads were washed twice with 50 uJ of 0.7 M ammonium citrate (pH 8.0). Finally, the beads were washed once 
with 30 of acetone and resuspended in 1 uJ of ultrapure water. The ligation mixture after immobilization on the beads 
was used for MALDS-TOF-MS analysis as described below. 

MALDt- TOF-MS 

[0282] A suspension of streptavidin -coated magnetic beads with the immobilized DNA was pipetted onto the sample 
holder, then immediately mixed with 0.5 u.1 matrix solution (0.7 M 3-hydroxypico!inic acid in 50% acetonitrile. 70 mM 
ammonium citrate). This mixture was dried at ambient temperature and introduced into the mass spectrometer. All 
spectra were taken In positive ion mode using a Finnlgan MAT Vision 2000 (Finnigan MAT, Bremen, Gemnany), 
equipped with a ref lectron (5 keV ion source, 20 keV postacceleratlon) and a nitrogen laser (337 nm). For the analysis 
of Pfu DNA ligase 0.5 u.1 of the solution was mixed on the sample holder with 1 uJ of matrix solution and prepared as 
described above. For the analysis of unpurified LCRs 1 \i\ of an LCR was mixed with 1 uJ matrix solution. 

RESULTS 

[0283] The E. colt tad gene served as a simple model system to investigate the suitability of MALDI-TOF-MS as 
detection method for products generated in ligase chain reactions. This template system contains of an E. cofi tad 
wildtype gene in a pBIuescrtpt Kll phagemld and an E. cofi tad gene carrying a single point mutation at bp 1 91 (C to 
T transition; SEQ ID No. 131) in the same phagemld. Four different oligonucleotides were used, which were ligated 
only if the E cofi tad wildtype gene was present (Figure 26). 

[0284] LCR conditions were optimized using Pfu DNA ligase to obtain at least 1 pmol ligation product in each positive 
reaction. The ligation reactions were analyzed by polyacrylamide gel electrophoresis (PAGE) and HPLC on the SMART 
system (Figures 27, 28 and 29). Figure 27 shows a PAGE of a positive LCR with wildtype template (lane 1 ), a negative 
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LCR with mutant template (1 and 2) and a negative control which contains enzyme, oligonucleotides and no template 
but salmon sperm DNA. The gel electrophoresis clearly shows that the ligation product (50 bp) was produced only in 
the reaction with wildtype template; whereas neither the template carrying the point mutation nor the control reaction 
with salmon sperm DNA generated amplification products. In Figure 28, HPLC was used to analyze two pooled LCRs 
5 with wildtype template performed under the same conditions. The ligation product was clearly revealed. Figure 29 
shows the results of a HPLC In which two pooled negative LCRs with mutant template were analyzed. These chroma- 
tograms confirm the data shown in Figure 27 and the results taken together clearly demonstrate, that the system 
generates ligation products In a significant amount only If the wildtype template is provided. 

[0285] Appropriate control runs were performed to determine retention times of the different compounds involved in 
10 the L CR experiments. These include the four oligonucleotides (A, B, C, and D), a synthetic ds 50-mer (with the same 
sequence as the ligation product), the wildtype template DNA, sonicated salmon sperm DNA and the Pfu DNA ligase 
in ligation buffer. 

[0286] In order to test which purification procedure should be used before a LCR reaction can be analyzed by MALDI- 
TOF-MS, aliquots of an unpurified LCR (Figure 30A) and aliquots of the enzyme stock solution (Figure 30B) were 
19 analyzed with MALDI-TOF-MS. It turned out that appropriate sample preparation is absolutely necessary since all 
signals In the unpurified LCR correspond to signals obtained in the MALDI-TOF-MS analysis of the Pfu DNA ligase. 
The calculated mass vaiues of oligo A and the ligation product are 7521 Da and 15450 Da, respectively. The data in 
Figure 30 show that the enzyme solution leads to mass signals which do interfere with the expected signals of the 
ligation educts and products and therefore makes an unambiguous signal assignment impossible. Furthermore, the 
spectra showed signals of the detergent Tween 20 being part of the enzyme storage buffer which influences the crys- 
tallization behavior of the analyte/matrix mixture in an unfavorable way. 

[0287] In one purification format streptavi din -coated magnetic beads were used. As was shown in a recent paper, 
the direct desorption of DNA immobilized by Watson-Crick base pairing to a complementary DNA fragment covalently 
bound to the beads Is possible and the non-biotinylated strand will be desorbed exclusively (Tang et aj. (1995) Nucleic 

25 Acids Res. 23:3126-3131). This approach in using Immobilized ds DNA ensures that only the norvbiotinylated strand 
will be desorbed. If non-immobilized ds DNA is analyzed both strands are desorbed (Tang et al. (1994) Rapid Comm. 
Mass Soectrom. 7 183-186) leading to broad signals depending on the mass difference of the two single strands. 
Therefore, employing this system for LCR only the non-ligated oligonucleotide A, with a calculated mass of 7521 Da, 
and the ligation product from oligo A and oligo B (calculated mass: 1 5450 Da) will be desorbed if oligo C is biotinylated 

30 at the 5'-end and immobilized on steptavt din-coated beads. This results In a simple and unambiguous Identification of 
the LCR educts and products. 

[0288] Figure 31 A shows a MALDI-TOF mass spectrum obtained from two pooled LCRs (performed as described 
above) purified on streptavidin Dyna Beads and desorbed directly from the beads showed that the purification method 
used was efficient (compared with Figure 30). A signal which represents the unlrgated oligo A and a signal which 

35 corresponds to the ligation product could be detected. The agreement between the calculated and the experimentally 
found mass vaiues is remarkable and allows an unambiguous peak assignment and accurate detection of the ligation 
product. In contrast, no ligation product but only oligo A could be detected in the spectrum obtained from two pooled 
LCRs with mutated template (Figure 3IB). The specificity and selectivity of the LCR conditions and the sensitivity of 
the MALDI-TOF detection is further demonstrated when performing the ligation reaction In the absence of a specific 

40 template. Figure 32 shows a spectrum obtained from two pooled LCRs In which only salmon sperm DNA was used as 
a negative control, only oligo A could be detected, as expected. 

[0289] While the results shown in Figure 31 A can be correlated to lane 1 of the gel in Figure 27, the spectrum shown 
in Figure 31 B is equivalent to lane 2 in Figure 27, and finally also the spectrum in Figure 32 corresponds to lane 3 In 
Figure 27. The results are In congruence with the HPLC analysis presented in Figures 28 and 29. While gel electro* 
45 phoresis (Figure 27) and HPLC (Figures 28 and 29) reveal either an excess or almost equal amounts of ligation product 
over ligation educts, the analysis by MALDI-TOF mass spectrometry produces a smaller signal for the ligation product 
(Figure 31 A). 

[0290] The lower intensity of the ligation product signal could be due to different desorption/ionization efficiencies 
between 24- and a 50-mer. Since the T m value of a duplex with 50 compared to 24 base pairs is significantly higher, 
more 24-mer could be desorbed. A reduction in signal intensity can also result from a higher degree of fragmentation 
in case of the longer oligonucleotides. 

[0291] Regardless of the purification with streptavidin DynaBeads, Figure 32 reveals traces of Tween20 in the region 
around 2000 Da. Substances with a viscous consistence, negatively influence the process of crystallization and there- 
fore can be detrimental to mass spectrometer analysis. Tween20 and also glycerol which are part of enzyme storage 
buffers therefore should be removed entirely prior to mass spectrometer analysis. For this reason an improved purifi- 
cation procedure which includes an additional ultrafiltration step prior to treatment with DynaBeads was investigated. 
Indeed, this sample purification resulted in a significant improvement of MALDI-TOF mass spectrometric performance. 
[0292] Figure 33 shows spectra obtained from two pooled positive (Fig. 33A) and negative (Fig. 33B) LCRs, respec- 
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lively. The positive reaction was performed with a chemically synthesized, single strand 50m er as template with a 
sequence equivalent to the ligation product of oligo C and D. Oligo C was 5'biotinylated. Therefore the template was 
not detected. As expected, only the ligation product of Oligo A and B (calculated mass 15450 Da) could be desorbed 
from the immobilized and ligated oligo C and D. This newly generated DNA fragment is represented by the mass signal 
of 1 5448 Da in Figure 33A. Compared to Figure 32A, this spectrum clearly shows that this method of sample preparation 
produces signals with improved resolution and Intensity. 

EXAMPLE 7 

Mutation detection by solid phase oligo base extension of a primer and analysis by MALDI-TOF mass 
spectrometry (Primer Oligo Base Extension = Probe) 

Summary 

[0293] The solid-phase oligo base extension method detects point mutations and small deletions as well as small 
insertions in amplified DNA. The method is based on the extension of a detection primer that anneals adjacent to a 
variable nucleotide position on an affinity-captured amplified template, using a DNA polymerase, a mixture of three 
dNTPs, and the missing one dideoxy nucleotide. The resulting products are evaluated and resolved by MALDI-TOF 
mass spectrometry without further labeling procedures. The aim of the following experiment was to determine mutant 
and wildtype alleles in a fast and reliable manner. 

Description of the experiment 

[0294] The method used a single detection primer followed by a oligonucleotide extension step to give products 
differing in length by some bases specific for mutant or wildtype alleles which can be easily resolved by MALDI-TOF 
mass spectrometry. The method is descrfeed by using as example the exon 10 of the CFTR-gene. Exon 10 of this 
gene bears the most common mutation in many ethnic groups (AF508) that leads in the homozygous state to the clinical 
phenotype of cystic fibrosis. 

MATERIALS AND METHODS 

Genomic DNA 

[0295] Genomic DNA were obtained from healthy individuals, individuals homozygous or heterozygous for the AF508 
mutation, and one individual heterozygous for the 1506S mutation. The wildtype and mutant alleles were confirmed 
by standard Sanger sequencing. 

PCR amplification of exon 10 of the CFTH gene 

[0296] The primers for PCR amplification were CFExlO-F (5-GCAAGTGAATCCTGAGCGTG-3* (SEQ ID No. 13) 
located in Intron 9 and biotinyJated) and CFExl 0-R (5-GTGTGAAGGGCGTG-3' SEQ ID No. 14) located in intron 10). 
Primers were used in a concentration of 8 prnol. Taq-polymerase including 1 0x buffer were purchased from Boehringer- 
Mannheim and oTNPs were obtained from Pharmacia. The total reaction volume was 50 ul. Cycling conditions for PCR 
were initially 5 min. at 95 0 C, followed by 1 min. at 94°C, 45 sec at 53°C, and 30 sec at 72°C for 40 cycles with a final 
extension time of 5 min at 72°C. 

Purification of the amplified products 

[0297] Amplification products were purified by using Qiagen's PCR purification kit (No. 28106) according to manu- 
facturer's instructions. The elutlon of the purified products from the column was done in 50 uJ TE-buffer (1 OmM Tris, 1 
mM EDTA, pH 7,5). 

Affinity-capture and denatu ration of the double stranded DNA 

[0298] 1 0 U.L aliquots of the purified amplified product were transferred to one well of a streptavidin-coated microliter 
plate (No. 1645684 Boehringer- Mannheim or No. 95029262 Labsystems). Subsequently, 10 p.! incubation buffer (80 
mM sodium phosphate, 400 mM NaCI, 0,4% Tween20, pH 7,5) and 30 u.l water were added. After Incubation for 1 hour 
at room temperature the welis were washed three times with 200uJ washing buffer (40 mM Tris, 1 mM EDTA, 50 mM 
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NaCI, 0.1% Tween 20, pH 8.8). To denature the double stranded DNA the wells were treated with 100 u.l of a 50 mM 
NaOH solution for 3 min and the wells washed three times with 200 u.l washing buffer. 

Oligo base extension reaction 

[0299] The annealing of 25 pmol detection primer <CF508: S'-CTATATTCATCATAGGAAACACCA-S' (SEQ ID No. 
15) was performed in 50 uJ annealing buffer (20 mM Trfs, 10 mM KCI, 10 mM (NH^SO^ 2 mM MgS02, 1% Triton X- 
100, pH 8) at 50°C for 10 mln. The wells were washed three times with 200 uJ washing buffer and once in 200 uJ TE 
buffer. The extension reaction was performed by using some components of the DNA sequencing kit from USB (No. 
70770) and dNTPs or ddNTPs from Pharmacia. The total reaction volume was 45 jxl f containing of 21 u.l water, 6 uJ 
Sequenase-buffer, 3 u.l 10 mM DTT solution, 4,5 u.1, 0,5 mM of three dNTPs, 4,5 u.1, 2 mM the missing one ddNTP, 5,5 
uJ glycerol enzyme dilution buffer, 0,25 uJ Sequenase 2.0, and 0,25 pyrophosphatase. The reaction was pipetted on 
ice and then incubated for 15 min at room temperature and for 5 min at 37°C. Hence, the wells were washed three 
times with 200 uJ washing buffer and once with 60 uJ of a 70 mM NH 4 -Citrate solution. 

Denaturation and precipitation of the extended primer 

[0300] The extended primer was denatured in 50 uJ 1 0%-DMSO (dimethylsuf oxide) in water at 80°C for 1 0 min. For 
precipitation, 10 uJ NH 4 -Acetate (pH 6.5), 0,5 uJ glycogen (10 mg/ml water, Sigma No. G1765), and 100 u,l absolute 
ethanol were added to the supernatant and incubated for 1 hour at room temperature. After centrifugation at 1 3.000 g 
for 1 0 min the pellet was washed in 70% ethanol and resuspended in 1 uJ 1 8 Mohm/cm H 2 0 water. 

Sample preparation and analysis on MALDI-TOF mass spectrometry 

[0301] Sample preparation was performed by mixing 0,3 uJ of each of matrix solution (0.7 M 3-hydroxypicolinic acid, 
0.07 M dibasic ammonium citrate in 1 :1 H2O.CH3CN) and of resuspended DNA/glycogen pellet on a sample target 
and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction Into the source region of 
an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode wtth 5 
and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M^cate)) were 
calculated from atomic compositions; reported experimental Mr (M r (exp)) values are those of the singly-protonated 
form, determined using external calibration. 

RESULTS 

[0302] The aim of the experiment was to develop a fast and reliable method independent of exact stringencies for 
mutation detection that leads to high quality and high throughput in the diagnosis of genetic diseases. Therefore a 
special kind of DNA sequencing (oligo base extension of one mutation detection primer) was combined with the eval- 
uation of the resulting mini-sequencing products by matrix-assisted laser desorption ionization (MALDI) mass spec- 
trometry (MS). The tlme-of-flight (TOF) reflectron arrangement was chosen as a possible mass measurement system. 
To prove this hypothesis, the examination was performed with exon 10 of the CFTR-gene, in which some mutations 
could lead to the clinical phenotype of cystic fibrosis, the most common monogenetic disease In the Caucasian pop- 
ulation. 

[0303] The schematic presentation as given in Figure 34 shows the expected short sequencing products with the 
theoretically calculated molecular mass of the wildtype and various mutations of exon 1 0 of the CFTR-gene (SEQ ID 
No. 132). The short sequencing products were produced using either doTTP (Figure 34A; SEQ ID Nos. 133-135) or 
ddCTP (Figure 34B; SEQ ID Nos. 1 36-1 39) to Introduce a definitive sequence related stop in the nascent DNA strand. 
The MALDI-TOF-MS spectra of healthy, mutation heterozygous, and mutation homozygous individuals are presented 
in Figure 35. All samples were conf irmed by standard Sanger sequencing which showed no discrepancy in comparison 
to the mass spec anarysis. The accuracy of the experimental measurements of the various molecular masses was 
within a range of minus 21 .8 and plus 87.1 dalton (Da) to the range expected. This allows a definitive Interpretation of 
the results In each case. A further advantage of this procedure is the unambiguous detection of the AI507 mutation. 
In the doTTP reaction , the wildtype allele would be detected, whereas In the ddCTP reaction the three base pair deletion 
would be disclosed. 

[0304] The method described is highly suitable for the detection of single point mutations or mlcroleslons of DNA. 
Careful choice of the mutation detection primers will open the window of multiplexing and lead to a high throughput 
including high quality in genetic diagnosis without any need for exact stringencies necessary in comparable allele- 
specific procedures. Because of the uniqueness of the genetic information, the oligo base extension of mutation de- 
tection primer is applicable In each disease gene or polymorphic region in the genome like variable number of tandem 
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repeats (VNTR) or other single nucleotide polymorphisms (e.g. , apolipoprotein E gene), as also described here. 
EXAMPLE 8 

Detection of Polymerase Chain Reaction Products Containing 7-Deazapurlne Moieties with Matrix-Assisted 
Laser Desorptlon/lonlzatlon Tlme-of-Fllght (MALDI-TOF) Mass Spectrometry 

MATERIALS AND METHODS 

Nucleic add amplifications 

[0305] The following ollgodeoxynucleotlde primers were either synthesized according to standard phosphoamldite 
chemistry (Sinha, N.D,. et aL, (1 983) Tetrahedron Let Vol. 24, Pp. 5843-5846; Sinha, N.D., etaL, (19B4) Nucleic Adds 
Res.. Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthesizer (Millipore, Bedford, MA, USA) in 200 nmol scales 
or purchased from MWG-Biotech (Ebersberg, Germany, primer 3) and Biometra (Goettingen, Germany, primers 6-7). 



primer 1 : 
primer 2: 
primer 3: 
primer 4: 
primer 5: 
primer 6: 



5'-GTCACCCTCGACCTGCAG (SEQ ID NO. 16); 
5 ' -TTGT A A AA C G ACQ G CC AGT (SEQ ID NO. 17); 
5' -CTTCC ACCGCG ATGTTG A (SEQ ID NO. 18); 
5'CAGGAAACAGCTATGAC (SEQ ID NO. 19); 
5'-GTAAAACGACGGCCAGT (SEQ ID NO. 20); 
5'GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ ID NO. 21); 



primer 7: 5'-GTTGTAAAACGAGGGCCAgT {g: RiboG) (SEQ ID NO. 
22); 

[0306] The99-mer(SEQIDNo. 1 41 ) and 200-mer DNA strands (SEQ ID No. 1 40; modified and unmodified) as well 
as the rtoo- and 7-deaza-modified 1 00-mer were amplified from pRFd DNA (10 ng, generously supplied by S. Feyer- 
abend, University of Hamburg) in 1 00 uL reaction volume containing 1 0 mrnol/L KCI , 1 0 mmol/L (NH^C^, 20 mmoi/ 
L Tris HCI (pH 8.8), 2 mmol/L MgS0 4 , (exo(-) Pseudococcus furiosus (Pfu) -Buffer, Pharmacia, Freiburg, Germany), 
0.2 mmol/L each dNTP (Pharmacia, Freiburg, Germany), 1 umol/L of each primer and 1 unit of exo(-)PfvDNA polymer- 
ase (Stratagene, Heidelberg, Germany). For the 99-mer primers 1 and 2, for the 200-mer primers 1 and 3 and for the 
1 00-mer primers 6 and 7 were used. To obtain 7-deazapurine modified nucleic acids, during PCR-amplification dATP 
and dGTP were replaced with 7-deaza-dATP and 7-deaza- dGTP. The reaction was performed in a thermal cyder 
(OmniGene, MWG-Biotech, Ebersberg, Germany) using the cycfe: denaturation at 95°C for 1 min., annealing at 51 *C 
for 1 min. and extension at 72*C for 1 min. For all PCRs the number of reaction cycles was 30. The reaction was 
allowed to extend for additional 1 0 min. at 72°C after the last cycle. 

[0307] The 103-mer DNA strands (modified and unmodified; SEQ ID No. 245) were amplified from M13mp18 RFI 
DNA (100 ng, Pharmacia, Freiburg, Germany) in 1 00 uL reaction volume, using primers 4 and 5 all other concentrations 
were unchanged. The reaction was performed using the cycle: denaturation at 95°C for 1 min., annealing at 40°C for 
1 min. and extension at 72°C for 1 min. After 30 cycles for the unmodified and 40 cycles for the modified 103-mer 
respectively, the samples were Incubated for additional 10 min. at 72°C. 

Synthesis of S'-fB-Pl-iabeted PCR-primers 

[0308] Primers 1 and 4 were S'-p^P]- labeled employing T4-polynucIeotidklnase (Epicentre Technologies) and 
(y- 32 P)-ATP. (BLU/NGG/502A, Dupont, Germany) according to the protocols of the manufacturer. The reactions were 
performed substituting 10% of primer 1 and 4 in PCR with the labeled primers under otherwise unchanged reaction- 
conditions. The amplified DNAs were separated by gel electrophoresis on a 1 0% poJyacryfamJde gel. The appropriate 
bands were excised and counted on a Packard TRI-CARB 460C liquid scintillation system (Packard, CT t USA). 
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Primer-cleavage from ribo-modified PCR-product 

[0309] The amplified DNA was purified using Ultrafree-MC filter units (30,000 NMWL), it was then redissofved In 100 
u.l of 0.2 mol/L NaOH and heated at 95°C for 25 minutes. The solution was then acidified with HC1 (1 mol/L) and further 
purified for MALDI-TOF analysis employing Ultrafree-MC filter units (10,000 NMWL) as described below. 

Purification of amplified products 

[0310] All samples were purified and concentrated using Ultrafree-MC units 30000 NMWL (Millipore, Eschborn, Ger- 
many) according to the manufacturer's description. After lyophilization, amplified products were redissofved In 5 u.L (3 
u.L for the 200-mer) of uttrapure water. This analyte solution was directly used for MALDI-TOF measurements. 

MALDI-TOF MS 

[0311] Allquots of 0.5 u.L of analyte solution and 0.5 u.L of matrix solution (0.7 mol/L 3-HPA and 0.07 mol/L ammonium 
citrate in acetonftrile/water (1:1, v/v)) were mixed on a flat metallic sample support. Afterdrying at ambient temperature 
the sample was Introduced into the mass spectrometer for analysis. The MALDI-TOF mass spectrometer used was a 
Rnnigan MAT Vision 2000 (Finnlgan MAT, Bremen, Germany). Spectra were recorded in the positive ion reflector mode 
with a 5 keV ion source and 20 keV postacceleration. The Instrument was equipped with a nitrogen laser (337 nm 
wavelength). The vacuum of the system was 3-4*1 0* 8 hPa in the analyzer region and 1 -4*1 0" 7 hPa in the source region. 
Spectra of modified and unmodified DNA samples were obtained with the same relative laser power; external caltoration 
was performed with a mixture of synthetic oligodeoxynucleotkJes (7-to 50-mer). 

RESULTS AND DISCUSSION 

Enzymatic synthesis of 7-deazapurine nucleotide containing nucleic adds by PCR 

[0312] In order to demonstrate the feasibility of MALDI-TOF MS for the rapid, gel-free analysis of short amplified 
products and to investigate the effect of 7-deazapurine modification of nucleic acids under MALDI-TOF conditions, two 
different primer-template systems were used to synthesize DNA fragments. Sequences are displayed in Figures 36 
and 37. While the two single strands of the 103-mer amplified product had nearly equal masses (Am = 8 u), the two 
single strands of the 99-mer differed by 526 u. Considering that 7-deaza purine nucleotide building blocks for chemical 
DNA synthesis are approximately 1 60 times more expensive than regular ones (Product Information, Glen Research 
Corporation, Sterling, VA) and their application in standard p-cyano-phosphoamidite chemistry is not trivial (Product 
Information, Glen Research Corporation, Sterling, VA; Schneider et al. (1995) Nucl. Acids Res. 23 :1570) the cost of 
7-deaza purine modified primers would be very high. Therefore, to increase the applicability and scope of the method, 
all PCRs were performed using unmodified oligonucleotide primers which are routinely available. Substituting dATP 
and dGTP by c 7 -dATP and c 7 -dGTP in polymerase chain reaction led to products containing approximately 80% 7-dea- 
za-purine modified nucleosides for the 99-mer and 103-mer; and about 90% for the 200-mer, respectively. Table II 
shows the base composition of all PCR products. 



TABLE II: 



Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products {unmodified and 7-deaza 

purine modified) 


DNA-fragments 1 


C 


T 


A 


G 


c 7 -deaza-A 


c 7 -deaza-6 


rel. mod.2 


200-mers 


54 


34 


56 


56 








modified 200-mer 
s 


54 


34 


6 


5 


50 


51 


90% 


200-mer a 


56 


56 


34 


54 








modified 200-mer 
a 


56 


56 


3 


4 


31 


50 


92% 


103-mer s 


28 


23 


24 


28 









*' and 'a' describe "sense" and "antisense* strands of the double -stranded amplified product. 



Indicates relative modification as percentage of 7-deaza purine modified nucleotides of total amount of purine nucleotides. 
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TABLE II: (continued) 



Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products (unmodified and 7-deaza 

purine modified) 


DNA-fragmente 1 


C 


T 


A 


G 


c 7 -deaza-A 


c 7 -deaza-6 


rel. mod.2 


modified 103-mer 
s 


28 


23 


6 


5 


18 


23 


79% 


103-mer a 


28 


24 


23 


28 








modified 103-mer 
a 


28 


24 


7 


4 


16 


24 


78% 


99-mer s 


34 


21 


24 


20 








modified 99-mer s 


34 


21 


6. 


5 


18 


15 


75% 


99-mer a 


20 


24 


21 


34 








modified 99-mer a 


20 


24 


3 


4 


18 


30 


87% 



1 "s" and "a* describe "sense' and "antteense" strands of the double-stranded amplified product. 



Indteates relative modification as percentage of 7-deaza purine modified nucleotides of total amount of purine nucleotides. 

[0313] It remained to be determined whether 80-90% 7-deaza-purine modification is sufficient for accurate mass 
spectrometer detection. It was therefore important to determine whether all purine nucleotides could be substituted 
during the enzymatic amplification step. This was not trivial since it had been shown that c 7 -dATP cannot fully replace 
dATP in PCR if 7arj DNA polymerase is employed (Seela, F. and A. Roellfng (1992) Nucleic Acids Res.. 20,55-61). 
Fortunately it was found that exo(-)Pru DNA polymerase indeed could accept c 7 -dATP and c 7 -dGTP in the absence of 
unmodified purine nucleoside triphosphates. The incorporation was less efficient leading to a lower yield of amplified 
product (Figure 38). 

[0314] To verify these results, the amplications with [^PJ-labeled primers were repeated. The autoradlogram (Figure 
39) clearly shows lower yields for the modified PCR-products. The bands were excised from the gel and counted. For 
all amplified products the yield of the modified nucleic acids was about 50%, referring to the corresponding unmodified 
amplification product. Further experiments showed that exo(-)Deep Vent and Vent DNA polymerase were able to in- 
corporate c 7 -dATP and c 7 -dGTP during PCR as well. The overall performance, however, turned out to be best for the 
exo(-)Pfty DNA polymerase giving least side products during amplification. Using all three polymerases, it was found 
that such PCRs employing c 7 -dATP and c 7 -dGTP instead of their isosteres showed less side-reactions giving a cleaner 
PCR-product. Decreased occurrence of amplification side products may be explained by a reduction of primer mis- 
matches due to a ling template which Is synthesized during PCR, Decreased melting point for DNA duplexes containing 
7-deaza-purine have been described (Mizusawa, S. et aL, (I 986) Nucleic Acids Res., 14, 1319-1324). In addition to 
the three polymerases specified above (exo(-) Deep Vent DNA polymerase, 5Vent DNA polymerase and exo(-) (Pfu) 
DNA polymerase), it is anticipated that other polymerases, such as the Large Klenow fragment of E.coll DNA polymer- 
ase, Sequenase, Taq DNA polymerase and U AmpfiTaq DNA polymerase can be used. In addition, where RNA is the 
template, RNA polymerases, such as the SP6 or the T7 RNA polymerase, must be used. 

MALDi-TOF mass spectrometry of modified and unmodified amplified products. 

[0315] The 99-mer, 103-mer and 200-mer amplified products were analyzed by MALDI-TOF MS. Based on past 
experience, it was known that the degree of depurination depends on the laser energy used for desorptlon and ionization 
of the analyte. Since the influence of 7-deazapurine modification on fragmentation due to depurination was to be in- 
vestigated, all spectra were measured at the same relative laser energy. 

[0316] Figures 40a and 40b show the mass spectra of the modified and unmodified 103-mer nucleic acids. In case 
of the modified 103-mer, fragmentation causes a broad (M + H)* signal. The maximum of the peak is shifted to lower 
masses so that the assigned mass represents a mean value of (M + H) + signal and signals of fragmented ions, rather 
than the (M + H)+ signal itself. Although the modified 1 03-mer still contains about 20% A and G from the oligonucleotide 
primers, rt shows less fragmentation which is featured by much more narrow and symmetric signals. Especially peak 
tailing on the lower mass side due to depurination, is substantially reduced. Hence, the difference between measured 
and calculated mass is strongly reduced although it Is stiH below the expected mass. For the unmodified sample a 
(M+H)+ signal of 31 670 was observed, which is a 97 u or 0.3% difference to the calculated mass. While, in case of the 
modified sample this mass difference diminished to 10 u or 0.03% (31713 u found, 31723 u calculated). These obser- 
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vations are verified by a significant increase in mass resolution of the (M4-H)+ signal of the two signal strands (n/Am = 
67 as opposed to 18 for the unmodified sample with Am = full width at half maximum, fwhm). Because of the low mass 
difference between the two single strands (8 u) their individual signals were not resolved. 

[0317] With the results of the 99 base pair DNA fragments the effects of increased mass resolution for 7-deazapurine 
containing DNA becomes even more evident. The two single strands in the unmodified sample were not resolved even 
though the mass difference between the two strands of the amplified product was very high with 526 u due to unequal 
distribution of purines and pyrimidines (figure 41 a). In contrast to this, the modified DNA showed distinct peaks for the 
two single strands (figure 41b) which demonstrates the superiority of this approach for the determination of molecular 
weights to gel electrophoretic methods even more profound. Although base line resolution was not obtained the indi- 
vidual masses were able to be assigned with an accuracy of 0.1%: Am = 27 u for the lighter (calc. mass = 30224 u) 
and Am = 14 u for the heavier strand (calc. mass = 30750 u). Again, it was found that the full width at half maximum 
was substantially decreased for the 7-deazapurine containing sample. 

[0318] In case the 99-mer and 1 03«mer, the 7-deazapurine containing nucleic acids seem to give higher sensitivity 
despite the fact that they still contain about 20% unmodified purine nucleotides. To get comparable signal-to-noise 
ratio at similar intensities for the (M + H) + signals, the unmodified 99-mer required 20 laser shots In contrast to 12 for 
the modified one and the 103-mer required 1 2 shots for the unmodified sample as opposed to three for the 7-deaza- 
purine nucleoside-containing amplified product. 

[0319] Comparing the spectra of the modified and unmodified 200-mer amplicons, improved mass resolution was 
again found for the 7-deazapurine containing sample as well as increased signal intensities (Figures 42A and 42B). 
While the signal of the single strands predominates m the spectrum of the modified sample the DNA-duplex and dimers 
of the single strands gave the strongest signal for the unmodified sample. 

[0320] A complete 7-deaza purine modification of nucleic acids may be achieved either using modified primers in 
PCR or cleaving the unmodified primers from the partially modified amplified product. Since disadvantages are asso- 
ciated with modified primers, as described above, alOO-mer was synthesized using primers with a ribo-modification. 
The primers were cleaved hydrolytically with NaOH according to a method developed earlier in our laboratory (Koester, 
H. et aL, ZPhysiot. Chem., 359, 1570-1589). Figures 43A and 43B display the spectra of the amplified product before 
and after primer cleavage. Figure 43b shows that the hydrolysis was successful: The hydrolyzed amplified product as 
well as the two released primers could be detected together with a small signal from residual uncleaved 1 00-mer This 
procedure is especially useful for the MALDI-TOF analysis of very short PCR-products since the share of unmodified 
purines originating from the primer Increases with decreasing length of the amplified sequence. 
[0321] The remarkable properties of 7-deazapurine modified nucleic acids can be explained by either more effective 
desorption and/or ionization, increased ion stability and/or a lower denaturation energy of the double stranded purine 
modified nucleic acid. The exchange of the N-7 for a methyl group results in the loss of one acceptor for a hydrogen 
bond which influences the ability of the nucleic acid to form secondary structures due to non-Watson-Crfck base pairing 
(Seela, F. and A. Kehne (1 987) Biochemistry, 26, 2232-2238 ). In addition to this the aromatic system of 7-deazapurine 
has a lower electron density that weakens Watson -Crick base pairing resulting in a decreased melting point (Mizusawa, 
S. et aL, (1 986) Nucleic Acids Pes., 1 4, 1 31 9-1 324) of the double-strand. This effect may decrease the energy needed 
for denaturation of the duplex in the MALDI process. These aspects as well as the loss of a site which probably will 
carry a positive charge on the N-7 nitrogen renders the 7-deazapurine modified nucleic acid less polar and may promote 
the effectiveness of desorption. 

[0322] Because of the absence of N-7 as proton acceptor and the decreased polarization of the C-N bond in 7-dea- 
zapurine nucleosides depurinatlon following the mechanisms established for hydrolysis in solution is prevented. Al- 
though a direct correlation of reactions in solution and in the gas phase is problematic, less fragmentation due to 
depurination of the modified nucleic acids can be expected In the MALDI process. Depurinatlon may either be accom- 
panied by loss of charge which decreases the total yield of charged species or It may produce charged f ragmentation 
products which decreases the intensity of the non fragmented molecular Ion signal. 

[0323] The observation of increased sensitivity and decreased peak tailing of the (M+H)+ signals on the lower mass 
side due to decreased fragmentation of the 7-deazapurine containing samples indicate that the N-7 atom indeed is 
essential for the mechanism of depurination in the MALDI-TOF process. In conclusion, 7-deazapurine containing nu- 
cleic acids show distinctly Increased Ion -stability and sensitivity under MALDI-TOF conditions and therefore provide 
for higher mass accuracy and mass resolution. 
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EXAMPLE d 

Solid Phase Sequencing and Mass Spectrometer Detection 
MATERIALS AND METHODS 

[0324] Oligonucleotides were purchased from Operon Technologies (Alameda, CA) In an unpurifled form. Sequenc- 
ing reactions were performed on a solid surface using reagents from the sequencing kit for Sequenase Version 2.0 
(Amersham, Arlington Heights, Illinois). 

Sequencing a 39-mer target 

Sequencing complex: 

[0325] 



SEQUENCE SEQ ID 

NO. 

5'-TCTGGCCTGQTGCAGGGCCTATTGTAGTTGTGACGTACA-tA b ).-3' 23 
5'-TGTACGTCACAACT-3' <PNA 16/ONA) 24 

[0326] In order to perform solid-phase DN A sequencing, template strand DNA1 1 683 was 3*-biotinylated by terminal 
deoxynucleottdyl transferase. A 30 pi reaction, containing 60 pmol of DNA116B3, 1 .3 nmol of biotin 14-dATP (GIBCO 
BRL, Grand Island, NY), 30 units of terminal transferase (Amersham, Arlington Heights, Illinois), and 1x reaction buffer 
(supplied with enzyme), was incubated at 37°C for 1 hour. The reaction was stopped by heat Inactivatlon of the terminal 
transferase at 70°C for 1 0 min. The resulting product was desalted by passing through a TE-1 0 spin column (Clontech). 
More than one molecules of biotin-14-dATP could be added to the 3'-end of DNA11683. The biotinylated DNA11683 
was incubated with 0.3 mg of Dynal streptavldln beads in 30 u.l 1 x binding and washing buffer at ambient temperature 
for 30 min. The beads were washed twice with TE and redissolved in 30 u,l TE, 10 uJ aliquot (containing 0.1 mg of 
beads) was used for sequencing reactions. 

[0327) The 0.1 mg beads from previous step were resuspended In a 10 u.1 volume containing 2 u.1 of 5x Sequenase 
buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCI 2 , and 250 mM NaCI) from the Sequenase kit and 5 pmol of corre- 
sponding primer PNA 1 67DNA. The annealing mixture was heated to 70 a C and allowed to cool slowly to room temper- 
ature over a 20-30 min time period. Then 1 u.10.1 M dithiolhreitol solution, 1 uJ Mn buffer (0.1 5 M sodium isocitrate and 
0.1 M MgCy, and 2 u.i of diluted Sequenase (3.25 units) were added. The reaction mixture was divided into fouraliquots 
of 3 u-l each and mixed with termination mixes (each contains of 3 uJ of the appropriate termination mix: 32 jiM c7dATP, 
32 u,M dCTP, 32 u,M c7dGTP, 32 \lM dTTP and 3.2 u.M of one of the four doTNPs, in 50 mM NaCI). The reaction 
mixtures were incubated at 37°C for 2 min. After the completion of extension, the beads were precipitated and the 
supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4°C. 

Sequencing a 78-mer target 
[0328] Sequencing complex: 

5'AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCT 

GCTGCTGGATGATCCGACGCATCAGATCTGG-(A b ) n -3'(SEQ ID NO. 25) 
(TNR.PLASM2) 

5'-CTGATGCGTCGGATCATC-3' (CM1) (SEQ ID NO. 26) 

[0329] The target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in 
previous section (sequencing a 39-mer target). 
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Sequencing a 15-mer target with partially duplex probe 
[0330] Sequencing complex: 

5 '-F-GATGATCCGACGCATCACAGCTC 3 ' (SEQ ID No. 27) 
5 -TCGGTTCCAAGAGCTGTGATGCGTCGGATCATC-b- 3 ' {SEQ ID No. 28) 

[0331] CM1 B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, Norway) by incubating 60 pmol of 
CM1 B3B with 0.3 magnetic beads in 30 u.1 1 M NaCI and TE (1x binding and washing buffer) at room temperature for 
30 min. The beads were washed twice with TE and redlssoived in 30 \i\ TE, 10 or 20 uJ aliquot (containing 0.1 or 0.2 
mg of beads respectively) was used for sequencing reactions. 

[0332] The duplex was formed by annealing corresponding aliquot of beads from previous step with 10 pmol of 
DF11a5F (or 20 pmol of DF11a6F for 0.2 mg of beads) in a 9 uJ volume containing 2 uJ of 5x Sequenase buffer (200 
mM Trts-HCI, pH 7.5, 100 mM MgClg, and 250 mM NaCI) from the Sequenase kit. The annealing mixture was heated 
to 65°C and allowed to cool stowry to 37°C over a 20-30 min time period. The duplex primer was then mixed with 10 
pmol of TS10 (20 pmol of TS10 for 0.2 mg of beads) in 1 u.l volume, and the resulting mixture was further incubated 
at37°Cfor5min, room temperature for 5-1 0 min. Then 1 u.10.1 M dith loth reltoi solution, 1 uJ Mn buffer (0.15 M sodium 
isocitrate and 0.1 M MnCI 2 ), and 2 uJ of diluted Sequenase (3.25 units) were added. The reaction mixture was divided 
into four aiiquots of 3 u.1 each and mixed with termination mixes (each contains of 4 u»l of the appropriate termination 
mix: 16 |iM dATP, 16 u.M dCTP, 16 u.M dGTP, 1 6 uM dTTP and 1 .6 u,M of one of the four ddNTPs, in 50 mM NaCI). 
The reaction mixtures were incubated at room temperature for 5 min, and 37°C for 5 min. After the completion of 
extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 u.l TE 
and kept at 4°C. An aliquot of 2 u.l (out of 20 uJ) from each tube was taken and mixed with 8 uJ of formamide, the 
resulting samples were denatured at 90-95°C for 5 min and 2 ui (out of 1 0 uJ total) was applied to an ALF DNA sequencer 
(Pharmacia, Piscataway, NJ) using a 1 0% pofyacrylamide gel containing 7 M urea and 0.6xTBE. The remaining aliquot 
was used for MALDI-TOF MS analysis. 

MALDI sample preparation and instrumentation 

[0333] Before MALDI analysis, the sequencing ladder loaded magnetic beads were washed twice using 50 mM 
ammonium citrate and resuspended in 0.5 uJ pure water. The suspension was then loaded onto the sample target of 
the mass spectrometer and 0.5 uJ of saturated matrix solution (3-hydroxypicollnic acid (HPA); ammonium citrate =10: 
1 mole ratio in 50% acetonitrite) was added. The mixture was allowed to dry prior to mass spectrometer analysis. 
[0334] The reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen, Germany) was used for 
analysis. 5 kV was applied in the ion source and 20 kV was applied for postacceleration. All spectra were taken in the 
positive ion mode and a nitrogen laser was used. Normally, each spectrum was averaged for more than 100 shots and 
a standard 25-point smoothing was applied. 

RESULTS AND DISCUSSION 

Conventional solid-phase sequencing 

[0335] In conventional sequencing methods, a primer is directly annealed to the template and then extended and 
terminated in a Sanger dideoxy sequencing. Normally, a blotinyiated primer is used and the sequencing ladders are 
captured by streptavidin-coated magnetic beads. After washing, the products are eluted from the beads using EDTA 
and formamide. Previous findings indicated that only the annealed strand of a duplex is desorbed and the immobilized 
strand remains on the beads. Therefore, it is advantageous to immobilize the template and anneal the primer. After 
the sequencing reaction and washing, the beads with the immobilized template and annealed sequencing ladder can 
be loaded directly onto the mass spectrometer target and mix with matrix. In MALDI, only the annealed sequencing 
ladder will be desorbed and Ionized, and the Immobilized template will remain on the target. 

[0336] A 39-mer template (SEQ ID No. 23) was first blotinyiated at the 3'-end by adding blotln-1 4-dATP with terminal 
transferase. More than one blotin-1 4-dATP molecule could be added by the enzyme. Since the template was Immobi- 
lized and remained on the beads during MALDI, the number of biotin-1 4-dATP would not affect the mass spectra. A 
14-mer primer (SEQ ID No. 24) was used for the solid-state sequencing to generate DNA fragments 3-27 below (SEQ 
ID Nos. 142-166). MALDI-TOF mass spectra of the four sequencing ladders are shown in Figure 44 and the expected 
theoretical values are shown in Table III. 
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TABLE III 



— JS— : 

1 


S'-TCTGQCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA^A 6 )^ J 


2 


3 '-TCAAC ACTGC ATGT-5- Q 


3 


3'-ATCAACACTGCATGT-5' 


4 


3 '-C ATCAACACTG C ATGT-5' 


5 


3'-ACATCAACACTG C ATGT-5 ' 


6 


3'-AACATCAACACTGCATGT-5' 


8 7 


3'-TAACATCAACACTGC ATGT-5' 


8 


3'- ATAACATCAACACTGC ATGT-5' 


9 


3'«GATAACATCAACACTGC ATGT-5' 


10 


3'-GGATAACATCAACACTGCATGT-5' 


11 


3' CGGATAACATCAACACTGCATGT-5' 


12 


3'-CC GG ATA ACATCAAC ACTGC ATGT-5' 


| 13 


3'-CCCGGATAACATCAACACTG C ATGT-5 ' 


1 14 


3'-TCCCGGATAACATCAACACTGC ATGT-5 ' 




3' -GTCCCGG ATAAC ATC AAC ACTG C ATGT- 5 r 


16 


3'-CGTCCCGGATAACATCAACACTGCATGT-5 ' | 


17 


3'- ACGTCCCGGATAACATCAACACTGC ATGT-5* | 


18 


3'-CACGTCCCGG AT AACATCAAC ACTGC ATGT-5' J 


19 


3'-CCACGTCCCGG ATA ACATCAAC ACTGC ATGT-5 ' | 




o -ALUALu 1 t^CGGATAACATCAACACTGCATGT-5 1 


21 


3'-GACCACGTCCCGGATAACATCAACACTGCATGT-5' | 


22 


3'-GGACCACGTCCCGGATAACATCAACACTGCATGT-5' | 


23 


3*-CGGACCACGTCCCGGATAACATCAACACTGC ATGT-5' 


24 


3'-CCGGACC ACGTCCCGGATAACATCAACACTGC ATGT-5' 


25 


3'-ACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 


26 


3 # -GACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5' 


27 


3' - A GACCGG ACCACGTCC CG G ATA AC ATCAAC ACTGC ATG T- 5' 
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TABLE ill (Continued) 





1 A -rnnMiftis 




1 G-reaction 


= 1 ■ I" 
T-reaction 


1 










2 
*• 




I 4223.8 


I 4223.8 


4223.8 


o > 


1 ACOl t 








A 




| 4809.2 






R 


1 CI oo X 








1 A 
B 


1 dAQ/l o 








n 








5737.8 


II A 

u o. 


I DUO 1 . 1 








II o 

II y . 






6379.2 




fl in 






4 a^ a*. At a 

6704.4 




ii* 




6995.6 






1 9 




TOO. o 




• 






7574.0 : 






1 A 








7878.2 


1 «5 

13' 






8207.4 




16 1 










1 7 

1 


ftffOfl R i 
oouo.o f 












9097.0 1 






19. j 




9386.2 1 






20. 1 


9699.4 I 








21. j 






10027.6 




22. J 






10355.8 




23. | 




10644.0 I 






24. 1 




10933.2 1 






25. 1 


11246.4 I 
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A-reactton 


C-reactton 


G -react/on 


T-reaction 


26. 






11574.6 




». 


11886.8 









10 

[0337] The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was deter- 
mined easiiy. One peak around 5150 appeared In all reactions are not identified. A possible explanation is that a small 
portion of the template formed some kind of secondary structure, such as a loop, which hindered sequenase extension. 
Mis-incorporation is of minor importance .since the intensity of these peaks were much lower than that of the sequencing 
*s ladders. Although 7-deaza purines were used In the sequencing reaction, which could stabilize the N-glycosidic bond 
and prevent depurination, minor base losses were still observed since the primer was not substituted by 7-deazapu- 
rines. The full length ladder, with a ddA at the 3* end, appeared in the A reaction with an apparent mass of 11899.8. A 
more intense peak of 12333 appeared in ail four reactions and Is likely due to an addition of an extra nucleotide by the 
Sequenase enzyme. 

20 [0338] The same technique could be used to sequence longer DNA fragments. A 78-mer template containing a CTG 
repeat (SEQ ID No. 25) was 3'-biotinylated by adding btotin-1 4-dATP with terminal transferase. An 1 8-mer primer (SEQ 
ID No. 26) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer 
extension. The four reactions were washed and analyzed by MALDI-TOFMS as usual. An example of the G-reaction 
is shown in Figure 45 (SEQ ID Nos. 1 67-220) and the expected sequencing ladder is shown in Table IV with theoretical 

25 mass values for each ladder component. All sequencing peaks were well resolved except the last component (theo- 
retical value 20577.4) was Indistinguishable from the background. Two neighboring sequencing peaks (a 62-mer and 
a 63-mer) were also separated indicating that such sequencing anaiysis could be applicable to longer templates. Again, 
an addition of an extra nucleotide by the Sequenase enzyme was observed in this spectrum. This addition is not 
template specific and appeared in all four reactions which makes it easy to be identified. Compared to the primer peak, 

3o the sequencing peaks were at much lower intensity in the long template case. 
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ddATP 



TABLE IV Continued 



ddCTP 



ddGTP 



ddTTP 



1. 



2. 



5491.6 



5491.6 



5764.8 



5491.6 



5491.6 



3. 6078.0 



4. 



6407.2 



5. 



6. 



7009.6 



6696.4 



7. 



8. 



7628.0 



7338.8 



9. 



7941.2 



10. 



8270.4 



11. 



8559.6 



12. 8872.8 



13. 



9202.0 



14 



9491.2 



15. 9804.4 



16. 



10133.6 



17. 



10422.88 



18. 10736.0 



19. 



11065.2 



20. 



11354.4 



21. 11667.6 



22. 



23 



12286.0 



11996.8 



24. 12599.2 
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25. 



12928.4 



26. 
27. 

28. | 13835.0 



29. 



30. 



13521.8 



14124.2 



14453.4 



13232.6 



31. 



32. 



14742.6 



1 5046.8 



33. 15360.0 



34. 15673.2 



35. 



36. 



15962.4 



16251.6 



37. 



38. 



39. 



40. 



41. 



1 6894.0 



17207.2 



16580.8 



17800.6 



17511.4 



42. 



18189.8 



43. 



44. 



18379.0 



18683.2 



45. 



46. 



19012.4 



1 9341 .6 



47. 



19645.8 



48. 



19935.0 



49. | 20248.2 
50. 



20577.4 
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51. 


20890.6 








52. 








21 1 94.4 | 


53. 




21484.0 






54. 








21788.2 


| 55. 








22092.4 



Sequencing using duplex DNA probes for capturing and priming 

[0339] Duplex DNA probes with single-stranded overhang have been demonstrated to be able to capture specific 
DNA templates and also serve as primers for solid-state sequencing. The scheme is shown in Figure 46. Stacking 
interactions between a duplex probe and a single-stranded template allow only a 5-base overhang to be sufficient for 
capturing. Based on this format, a 5* fluorescent-labeled 23-mer (5*-GAT GAT CCG ACG CAT CAC AGC TC-3') (SEQ 
ID No. 29) was annealed to a S'-biotinytated 1 B-mer <5'-GTG ATG CGT CGG ATC ATC-3') (SEQ ID No. 30), leaving a 
5-base overhang. A 15-mer template (5'-TCG GTT CCA AGA GCT-3') (SEQ ID No. 31) was captured by the duplex 
and sequencing reactions were performed by extension of the 5-base overhang. MALDI-TOF mass spectra of the 
reactions are shown in Figure 47A-D. All sequencing peaks were resolved although at relatively low intensities. The 
last peak in each reaction is due to unspecific addition of one nucleotide to the full length extension product by the 
Sequenase enzyme. For comparison, the same products were run on a conventional DNA sequencer and a stacking 
f fuorogram of the results is shown in Figure 48. As can be seen from the Figure, the mass spectra had the same pattern 
as the fluorogram with sequencing peaks at much lower Intensity compared to the 23-mer primer. 

EXAMPLE 10 



Thermo Sequenase Cycle Sequencing 
MATERIALS AND METHODS 



[0340] PCR amplification. Human leukocytic genomic DNA was used for PCR amplification. PCR primers to amplify 
a 209 bp fragment of the 0-globln gene were the 02 forward primer (S'-CAT TTG CTT CTG ACA CAA CTG-3' SEQ ID 
NO. 32) and the p1 1 reverse primer (5'-CTT CTC TGT CTC CAC ATG C-3' SEQ JD NO. 33). Taq polymerase and 1 0x 
buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg. Germany). The 
total reaction volume was 50 at including 8 pmol of each primer with approximately 200 ng of genomic DNA used as 
template and a final dNTP concentration of 200 uW. PCR conditions were: 5 min at 94°C, followed by 40 cycles of 30 
sec at 94°C, 45 sec at 53*C. 30 sec at 72*C, and a final extension time of 2 min at 72°C. The generated amplified 
product was purified and concentrated (2x) with the Qiagen 'Qlaquick' PCR purification kit (#281 06) and stored in H 2 0. 
[0341 J Cycle Sequencing. Sequencing ladders were generated by primer extension with Thermo Sequenase™ -DNA 
Polymerase (Amersham LIFE Science, #E79000Y) under the following conditions: 7 pmol of HPLC purified primer 
(Cod5 1 2mer: 5'-TGC ACC TGA CTC-3* SEQ ID No. 34) were added to 6u.l purified and concentrated amplified product 
(I.e. 1 2uJ of the original amplified product), 2.5 units TTiermo Sequenase and 2.5 ml Thermo Sequenase reaction buffer 
in a total volume of 25u,l. The final nucleotide concentrations were 30u,M of the appropriate ddNTP (ddATP, ddCTP, 
ddGTP orddTTP; Pharmacia Biotech, #27-2045-01) and 210u,M of each dNTP (7-deaza-dATP, DCTP, 7-deaza-GTP 
dTTP; Pharmacia Biotech). 

[0342] Cycling conditions were: denaturation for 4 min at 94°C, followed by 35 cycles of 30 sec at 94°C 30 sec at 
38°C, 30 sec at 55°C, and a final extension of 2 min at 72*C. 

[0343] Sample preparation and analysis by MALDI-TOF MS. After completion of the cycling program, the reaction 
volume was Increased to 50u.l by addition of 25ulH 2 C. Desaitingwas achieved by shaking 30 uJ of ammonium saturated 
DOWEX (Fluka #44485) cation exchange beads with 50 u.1 of the analyte for 2 min at room temperature. The Dowex 
beads, purchased In the protonated form, were pre-treated with 2M NH 4 OH to convert them to the ammonium form, 
then washed with HgO until the supernatant was neutral, and finally put in 1 0 mM ammonium citrate for usage. After 
the cation exchange, DNA was purified and concentrated by ethanol precipitation by adding 5ui 3 M ammonium acetate 
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(pH 6.5), 0.5 jil glycogen (10 mg/ml, Sigma), and 110 jxl absolute ethanol to the analyte and incubated at room tem- 
perature for 1 hour. After 12 min centrifugal ion at 20,000 X g the pellet was washed In 70% ethanol and resuspended 
in 1 |il 18 Mohm/cm H 2 0 water. 

[0344] For MALDI-TOF MS analysis 0.35 u.l of resuspended DNA was mixed with 0.35-1 .3 u.l matrix solution (0.7 M 
3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H^Ch^CN) on a stainless steel sample target disk 
and allowed to air dry preceding spectrum acquisition using a Thermo Bfoanalysis Vision 2000 MALDI-TOF operated 
in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. External calibration generated 
from eight peaks (3000-1 8000 Da) was used for all spectra. 

RESULTS 

[0345] FIGURE 49 shows a MALDI-TOF mass spectrum of the sequencing ladder generated from a biological am- 
plified product as template and a 12mer (5*-TGC ACC TGA CTC-3'(SEQ ID NO.34)) sequencing primer. The peaks 
resulting from dep urinations and peaks which are not related to the sequence are marked by an asterisk. MALDI-TOF 
MS measurements were taken on a reflectron TOF MS. A,) Sequencing ladder stopped with ddATP; B.) Sequencing 
ladder stopped with ddCTP; C.) Sequencing iadder stopped with ddGTP; D.) Sequencing ladder stopped with ddTTP. 
[0346] FIGURE 50 shows a schematic representation of the sequencing ladder generated In Fig. 49 with the corre- 
sponding calculated molecular masses up to 40 bases after the primer (SEQ ID Nos 221-260). For the calculation the 
following masses were used: 3581 .4 Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for 
dCTP and 328.2 Da for 7-dea2a-dGTP. 

[0347] FIGURE 51 shows the sequence of the amplified 209bp amplified product within the 0-globin gene (SEQ ID 
No. 261 ), which was used as a template for sequencing. The sequences of the appropriate PCR primer and the location 
of the 12mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 after 
the primer. In a wildtype sequence this T would be replaced by an A. 

EXAMPLE 11 

Mlcrosatellite Analysis Using Primer Ollgo Base Extension (PROBE) and MALDI-TOF Mass Spectrometry 
SUMMARY 

[0348] The method uses a single detection primer followed by an oligonucleotide extension step to give products 
differing in length by a number of bases specific for the number of repeat units or for second site mutations within the 
repeated region, which can be easily resolved by MALDI-TOF mass spectrometry. The method is demonstrated using 
as a model system the AluVpA polymorphism in intron 5 of the interferon-a receptor gene located on human chromo- 
some 21 , and the poly T tract of the splice acceptor site of intron 8 from the CFTR gene located on human chromosome 
7. 

MATERIALS AND METHODS 

[0349] Genomic DNA was obtained from 1 8 unrelated individuals and one family including of a mother, father, and 
three children. The repeated region was evaluated conventionally by denaturing gel electrophoresis and results ob- 
tained were confirmed by standard Sanger sequencing. 

[0350] The primers for PCR amplification (8 pmol each) were IFNAR-IVS5-5': (5'-TGC TTA CTT AAC CCA GTG TG- 
3' SEQ ID. NO.35) and IFNAR-IVS5-3V2: (5'-CAC ACT ATG TAA TAC TAT GC-3' SEQ ID. NO.36) for a part of the 
intron 5 of the Interferon-a receptor gene, and CFEx9-F:(5'-GAA AAT ATC TGA CAA ACT CAT C-3 1 SEQ ID. NO .37) 
(5*-biotiny!ated) and CFExg-Rr^-CAT GGA CAC CAA ATT AAG TTC-3'SEQ ID. NO.38) for CFTR exon 9 with flanking 
intron sequences of the CFTR gene. Taq-porymerase including 1 0x buffer were purchased from Boehringer-Mannheim 
and dNTPs were obtained from Pharmacia. The total reaction volume was 50 u.1 . PCR conditions were 5 mln at 94°C 
followed by 40 cycles of: 1 mln at 94°C, 45 sec at 53°C, and 30 sec at 72°C, and a final extension time of 5 mln at 72*C. 
£0351] Amplification products were purified using Qlagen's PCR purification kit (No.28106) according to manufac- 
turer's instructions. Purified products were eluted from the column in 50 uJ TE-buffer (10mM Tris-HCI, 1 mM EDTA, pH 



A) Primer oligo base extension reaction (thermo cycling method 

[0352] CyclePROBE was performed with 5 pmol appropriate detection primer (IFN:5'-TGA GAC TCT GTC TC-3'SEQ 
ID. NO.39) in a total volume of 25 uJ including I pmol purified template, 2 units Thermosequenase (Amersham Life 
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Science, Cat. #E79000Y) 2.5 jjj Tnermosequenase buffer, 25 umol of each deoxynucleotide (7-deaza-dATP, dTTP, 
and in some experiments extra dCTP) and 100 ujtk>I of dideoxyguanine and in some experiments additional ddCTP. 
Cycling conditions: initial denalu ration 94*C for 5 min followed by 30 cycles with 44°C annealing temperature for 30 
sec and 55°C extension temperature for 1 min. 

Primer otigo base extension reaction (isothermal method) 

[0353] 1 o u.l aliquots of the purified double-stranded amplified product (-3 pmol) were transferred to a streptavidin- 
coated microliter plate well (-16 pmol capacity per 50 uJ volume; No. 1645684 Boehringer-Mannheim), followed by 
addition of 10 at incubation buffer (60 mM sodium phosphate, 400 mM NaCI, 0.4% Tween 20, pH 7.5) and 30 p.1 water. 
After incubation for 1 hour at room temperature, the wells were washed three times with 200 uJ washing buffer A (40 
mM Tris, 1 mM EDTA, 50 mM NaCI, 0.1 % Tween 20, pH 8.8) and Incubated with 1 00 uJ of 50 mM NaOH for 3 min to 
denature the double-stranded DNA. Finally, the wells were washed three times with 200 uJ 70 mM ammonium citrate 
solution. 

[0354] The annealing of 100 pmol detection primer (CFpT: S'-TTC CCC AAA TCC CTG-3' SEQ ID NO. 40) was 
performed In 50 U.I anneaJing buffer (50 mM ammonium phosphate buffer, pH 7.0 and 1 00 mM ammonium chloride) at 
65°C for 2 min, at 37°C for 10 min, and at room temperature for 10 min. The wells were washed three times with 200 
Hi washing buffer B (40 mM Tris, 1 mM EDTA, 50 mM NH 4 CJ, 0.1% Tween 20, pH 8.8) and once in 200 uJ TE buffer. 
The extension reaction was performed using some components of the DNA sequencing kit from USB (No. 70770) and 
dNTPs or ddNTPs from Pharmacia. Total reaction volume was 45 uJ, containing of 21 ul water, 6 u.l Sequenase-buffer, 
3 u.l 100 mM DTT solution, 50 umol of 7-dea*a-dATP, 20 umol ddCTP, 5.5 uJ glycerol enzyme dilution buffer, 0.25 uJ 
Sequenase 2.0, and 0.25 uJ pyrophosphatase. The reaction was pipetted on ice and incubated for 15 min at room 
temperature and for 5 min at 37°C. Finally^he wells were washed three times with 200 uJ washing buffer B. 
[0355] The extended primer was denatured from the template strand by heating at 80°C for 10 min in 50 u.l o' a 50 
mM ammonium hydroxide solution. 

[0356] For precipitation, 10 uJ 3 M NH ^-acetate (pH 6.5), 0.5 u.l glycogen (10 mg/ml water, Sigma. Cat.#G1 765), 
and 110 u,l absolute ethanol were added to the supernatant and incubated for 1 hour at room temperature. After cen- 
trif ugation at 1 3.000 g for 1 0 min the pellet was washed in 70% ethanol and resuspended in 1 \i\ 18 Mohm/cm H s O water. 
[0357] Sample preparation was performed by mixing 0.6 u.1 of matrix solution (0.7 M 3-hydroxyplcolinic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 H 2 0:CH 3 CN) with 0.3 uJ of resuspended DNA/glycogen pellet on a sample target and 
allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of a 
Thermo BioanaJysis (formerly Finntgan) Visions 2000 MALDI-TOF operated In reftectron mode with 5 and 20 kV on 
the target and conversion dynode, respectively. Theoretical average molecular mass (M r (calc)) were calculated from 
atomic compositions; reported experimental M r (M r (exp)) values are those of the singfy-protonated form, determined 
using external calibration. 

RESULTS 

[0358] The aim of the experiments was to develop a fast and reliable method for the exact determination of the 
number of repeat units in microsatellites or the length of a mononucleotide stretch including the potential to detect 
second site mutations within the polymorphic region. Therefore, a special kind of DNA sequencing (primer oligo base 
extension, PROBE) was combined with the evaluation of the resulting products by matrix-assisted laser desorptfon 
ionization (MALDI) mass spectrometry (MS). The Urne-of-flight (TOF) reflectron arrangement was chosen-as a possible 
mass measurement system. As an initial feasibility study, an examination was performed first on an AluVpA repeat 
polymorphism located in intron 5 of the human interferon-a receptor gene (cyclePROBE reaction) and second on the 
poly T tract located In intron 8 of the human CFTR gene (isothermal PROBE reaction). 

[0359] A schematic presentation of the cyclePROBE experiment for the AluVpA repeat polymorphism is given in 
Figure 52. The extension of the antisense strand (SEQ ID No. 262) was performed with the sense strand serving as 
the template. The detection primer is underlined. In a family study co-dominant segregation of the various alleles could 
be demonstrated by the eiectrophoretic procedure as well as by the cydePROBE method followed by mass spec 
analysis (Figure 53). Those alleles of the mother and child 2, for which direct electrophoresis of the amplified product 
indicated one of the two copies to have 13 repeat units.were measured using cyclePROBE to have instead only 11 
units using ddG as terminator. The replacement of ddG by ddC resulted In a further unexpected short allele with a 
molecular mass of approximately 11 650 in the DNA of the mother and child 2 (Figure 54). Sequence analysis verified 
this presence of two second site mutations in the aiiele with 13 repeat units. The first Is a C to T transition In the third 
repeat unit and the second mutation is a T to G transversion in the ninth repeat unit. Examination of 28 unrelated 
Individuals shows that the 13 unit allele is spliced into a normal allele and a truncated aiiele using cyclePROBE. Sta- 
tistical evaluation shows that the polymorphism is in Hardy-Weinberg equilibrium for both methods, however, using 
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cyclePROBE as detection method the polymorphism information content is increased to 0.734. 
[0360] PROBE was aJso used as an isothermic method for the detection of the three common alleles at the intron 8 
splice acceptor site of the CFTR gene {SEQ ID No. 263). Figure 55 shows a schematic presentation of the expected 
diagnostic products (SEQ ID Nos. 264-266) with the theoretical mass values. The reaction was also performed In the 
antisense direction. 

[0361] Figure 56 demonstrates that aU three common alleles (T5, T7, and T9, respectively) at this locus could be 
reliably disclosed by this method. Reference to Figure 56 Indicates that mass accuracy and precision with the reflectron 
time of flight used in this study ranged from 0-0.4%, with a relative standard deviation of 0.13%. This corresponds to 
far better than single base accuracy for the up to <90-mer diagnostic products generated in the IFNAR system. Such 
high analytical sensitivity is sufficient to detect single or multiple insertion/deletion mutations within the repeat unit or 
its flanking regions, which would Induce > 1 % mass shifts in a 90-mer. This is analogous to the Figure 56 polyT tract 
analysis. Other mutations (i.e. an A to T or a T to A mutation within the IFNAR gene A3T repeat) which do not cause 
premature product termination are not detectable using any dNTP/ddNTP combination with PROBE and low perform- 
ance MS instrumentation; a 9 Da shift in a 90-mer corresponds to a 0.03% mass shift. Achieving the accuracy and 
precision required to detect such minor mass shifts has been demonstrated with higher performance instrumentation 
such as Fourier transform (FT)MS, for which single Da accuracy is obtained up to 100-mers. Further, tandem FTMS, 
in which a mass shifted fragment can be isolated within the instrument and dissociated to generate sequence specific 
fragments, has been demonstrated to locate point mutations to the base in comparably sized products. Thus the com- 
bination of PROBE with higher performance instrumentation will have an analytical sensitivity which can be matched 
only by cumbersome full sequencing of the repeat region. 

EXAMPLE 12 

Improved Apo lipoprotein E Genotyplng Using Primer Oligo Base Extension (PROBE) and MALDI-TOF Mass 
Spectrometry 

MATERIALS AND METHODS 

PCR amplification. 

[0362J Human leukocytic genomic DNA from 100 anonymous individuals from a previously published study (Braun, 
A et a/., (1992) Human Genet. 89 :401 -406) were screened for apolipoprotein E genotypes using conventional methods. 
PCR primers to amplify a portion of exon 4 of the apo E gene were delineated according to the published sequence 
(Das, HK et aL, (1985) J. Bioi. Chem. 260 :6240-6247) (forward primer, apoE-F: 5*«GGC ACG GCT GTC CAA GGA G- 
3'SEQ ID. NO.41 ; reverse, apoE-R: 5'-AGG CCG CGC TCG GCG CCC TC-3'SEQ ID. NO.42). Taq polymerase and 
10x buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). 
The total reaction volume was 50 u.L including 8 pmol of each primer and 10% DMSO (dim ethylsulf oxide, Sigma) with 
approximately 200 ng of genomic DNA used as template. Solutions were heated to 80°C before the addition of 1U 
polymerase; PCR conditions were: 2 mln at 94°C, followed by 40 cycles of 30 sec at 94*C, 45 sec at 63"C, 30 sec at 
72°C, and a final extension time of 2 mln at 72°C. 

Restriction enzyme digestion and potyacrylamide electrophoresis. 

[0363] Cfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim, and Hhal from Pharmacia 
(Freiburg, Germany). For Cfol alone and simultaneous Cfol/Rsal digestion, 20 pL of amplified products were diluted 
with 15 uJ water and 4 pL Boehringer-Mannheim buffer L; after addition of 10 units of appropriate restriction enzyme 
(s) the samples were incubated for 60 min at 37°C. The procedure for simultaneous Hhat/Rsal digestion required first 
digestion by Rsal in buffer L for one hour followed by addition of NaCI (50 mM end concentration) and Hhal, and 
additional Incubation for one hour. 20 pL of the restriction digest were analv2ed on a 12% poJyacrylamide gel as de- 
scribed elsewhere (Hixson (1990) J. Lipid Res. 31:545-548). Recognition sequences of Rsal and Cfol (Hhal) are GT/ 
AC and GCG/C, respectively; masses of expected digestion fragments from the 252-mer ampiified product with Cfol 
alone and the simultaneous double digest with Cfol (or Hhal) and Rsal are given in Table V. 

Thenrto-PROBE. 

[0364] PCR amplification was performed as described above, but with products purified with the Qiagen' Qiaquick' 
kit to remove unincorporated primers. Multiplex Thermo-PROBE was performed with 35 uJ amplified product and 8 
pmol each of the codon 112 (5'-GCG GAC ATG GAG GAC GTG-3' SEQ ID. NO. 43) and 158 (5'-GAT GCC GATGAC 
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CTG CAG AAG-3' SEQ ID. NO .44) detection primers in 20 uJ including — 1 pmol purified biotinylated antfsense template 
immobilized on streptavidln coated magnetic beads, 2.5 units Therm osequenase, 2uJ Thermosequenase buffer, 50 
u.M of each dNTP and 200 p.M of ddXTP, with the base identity of N and X as described in the text. Cycling conditions 
were: denaturation (94°C, 30 sec) followed by 30 cycles at 94*C (10 mln) and 60°C (45 sec). 

Sample preparation and analysis by MALD/-TOF MS. 

[0365] For precipitation (Stufts era/., (1 991) Rapid Commun. Mass Spectrom. 5: 359-363) of both digestsand PROBE 
products, 5u.l 3M ammonium acetate (pH 6.5), 0.5uJ glycogen (1 0 mg/ml, Sigma), and 110 uJ absolute ethanol were 
added to 50 uJ of the analyte solutions and stored for 1 hour at room temperature. After 1 0 mln centrifugation at 13,000 
X g the pellet was washed in 70% ethanol and resuspended in 1 uJ 1 8 Mohm/cm H 2 0 water Where noted in the text, 
additional desalting was achieved by shaking 10-20uX of ammonium saturated DOWEX (Fluka #44485) cation ex- 
change beads in 40u.L of analyte. The beads, purchased in the protonated form, were pre-treated with three 5 min 
spin-decant steps in 2M NH^OH, followed with H 2 0 and 10 mM ammonium citrate. 

[0366] 0.35uX of resuspended DNA was mixed with 0.35-1 .3ul- matrix solutions (Wu et al. (1 993) Rapid Commun. 
Mass Spectrom. 7:1 42-146) 0.7 M 3-hydroxypjcolinic acid (3-HPA), 0.07 M ammonium citrate in 1 :1 H 2 0:CH 3 CN) on 
a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bloanalysis 
Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respec- 
tively. Theoretical average molecular masses (M r (calc)) of the fragments were calculated from atomic compositions; 
the mass of a proton (1 .08 Da) is subtracted from raw data values in reporting experimental molecular masses (M r 
(exp)) as neutral basis. An external calibration generated from eight peaks (3000-18000 Da) was applied to all spectra. 

RESULTS 

Digestion with Cfol alone. 

[0367] The inset to Figure 57a shows a 12% polyacrylamide gel electrophoretic separation of an G3/G3 genotype 
after digestion of the 252 bp apo E amplified product with CfoL Comparison of the electrophoretic bands with a molecular 
weight ladder shows the cutting pattern to be as mostly as expected (Table V) for the G3/G3 genotype. Differences 
are that the faint band at approximately 25 bp is not expected, and the smallest fragments are not observed. The 
accompanying mass spectrum of precipitated digest products shows a similar pattern, albert at higher resolution. Com- 
parison with Table V shows that the observed masses are consistent with those of single-stranded DNA; the combination 
of an acidic matrix environment (3-HPA, pK a 3) and the absorption of thermal energy via interactions with the 337 nm 
absorbing 3-HPA upon ionization is known to denature short stretches of dsDNA under normal MALDJ conditions (Tang, 
K f*L« 0 Rapid Commun Mass Spectrom 8:1 83-1 86). 

[0368] The approximately 25-mers, unresolved with electrophoresis, are resolved by MS as three single stranded 
fragments; while the largest (7427 Da) of these may represent a doubly charged ion from the 14.8 kDa fragments (m 
- 1 4850, z = 2; m/z = 7425), the 671 5 and 71 53 Da fragments could result from PCR artifacts or primer impurities; all 
three peaks are not observed when amplified products are purified with Qiagen purification kits prior to digestion. The 
Table V 8871 Da 29-mer sense strand 3*-termlnal fragment is not observed; the species detected at 9186 Da is con- 
sistent with the addition of an extra base (9187 - 8871 = 316, consistent with A) by the Taq-polymerase during PCR 
amplification (Hu, G et ah, (1993) DNA and Cell Biol 12:763-770). The individual single strands of each double strand 
with <35 bases (1 1 kDa) are resolved as single peaks; the 48-base single strands (M^cafc) 1 4845 and 1 4858), however, 
are observed as an unresolved single peak at 14850 Da. Separating these into single peaks would require a mass 
resolution (m/Am, the ratio of the mass to the peak width at half height) of 1 4850/13 = 1140, nearly an order of magnitude 
greater than what is routine with the standard reflectron time-of-filght instrumentation used in this study; resolving such 
small mass differences with high performance Instrumentation such as Fourier transform MS, which provides up to 
three orders of magnitude higher resolution in this mass range, has been demonstrated. The 91-mer single strands 
(M r (calc)27849 and 28436) are also not resolved, even though this requires a resolution of only < 50. The dramatic 
decrease In peak quality at higher masses is due to metastable fragmentation (I.e. depurlnation) resulting from excess 
internal energy absorbed during and subsequent to laser irradiation. 

Simultaneous digestion with Cfol andRsal. 

[0369] Figure 57b (inset) shows a 12% polyacryiamide gel electrophoresis separation of £3/E3 double digest prod- 
ucts, with bands consistent with dsDNA with 24, 31, 38, 4B, and 55 base pairs, but not for the smaller fragments. 
Although more peaks are generated (Table V) than with Cfol alone, the corresponding mass spectrum is more easily 
interpreted and reproducible since all fragments contain < 60 bases, a size range far more appropriate for MALDI-MS 
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If reasonably accurate M r values (e.g. , 0.1%) are desired. For fragments in this mass range, the mass measuring 
accuracy using external calibration is -0.1% (i.e. < + 10 Da at 10 kDa). Significant depurination (indicated in Figure by 
asterisk) is observed for all peaks above 10 kDa, but even the largest peak at 17171 Da is clearly resolved from its 
depurination peak so that an accurate M r can be measured. Although molar concentrations of digest products should 
be identical, some discrimination against those fragments with <11 bases is observed, probably due to their loss in the 
ethanol/glycogen precipitation step. The quaiity of MS results from simultaneous digestion with Cfol (or Hhal) and Rsal 
is superior to those with Cfol (or Hhal) alone, since the smaller fragments generated are good for higher mass accuracy 
measurements, and with ail genotypes there is no possibility for dimer peaks overlapping with high mass diagnostic 
peaks. Since digestion by Rsal/Cfol and RsatrtHha! produce the same restriction fragments but the former may be 
performed as a simultaneous digest since their buffer requirements are the same, this enzyme mixture was used for 
all subsequent genotypfng by restriction digest protocols. 
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Mass and Copy Number of Expected Restriction Digest Products |j 


Table Va Cfol Digestion 
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(+) (-) 


e2/e2 


e2/e2 


e2/e2 
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e2/e2 
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1 


2 


10752, 10921 




1 


1 


2 


2 


2 


14845, 14858 




1 


1 


2 


2 


2 


22102, 22440 


mm 




1 




1 


2 


25575, 25763 


2 


1 


1 








27849, 28436 


Z 


2 


1 


2 


1 




Table Vb. Cfol/Rsai Dige 
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(+) (■) 


e2/e2 


e2/e3 


e2/e4 


e3/e3 


a3/e4 


e4/e4 


[ 3428, 4025 
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1 


2 


2 


2 


j 5283, 5880 






1 




1 


2 | 


5781 , 5999 






1 




1 


2 1 


11279, 11627 


2 


2 


1 


2 


1 




14B45, 14858 




1 


1 


2 


2 


2 


18269, 16848 


2 


2 


1 
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_ - i 



20 



25 



30 



35 



40 



„ - — i ...... • ™-i -»wfc-r» -wr, -J-r i ~ru, Ofl f I , so<0 UU. 

b CfoL / Rsal ^variant fragment meases: 1848, 2177, 2186, 2436, 4024, 5004. 5412, 5750. 6745. 7510. 8871 . 9628. 16240, 17175 Da. 



Table VI 
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doT M r (Calc) 


ddT M r (Exp) 


ddC M 5 (Calc) 


ddC M r (Exp) 


e2/e2 


a 591 8, b 6768 




•6536, b 7387 




e2/e3 


•5918, *$7$B*1KG& 


5919, 6769, 7967 


•6536, 06753, b 7387 


6542, 6752, 7393 


e2/e4 


•5918, bBrBB, b 7965, «8970 




•5903, b 6536, b 6753, «7387 




J e3/e3 


•5918. b 7965 


5918, 7966 


•6536, ^753 


6542, 6756 


e3/e4 


•5918, b 7965, •8970 


5914, 7959, 8965 


"5903, 06536, b 6753 


5898, 6533, 6747 


e4/e4 


b 7985. •8970 


7966, 8969 


•5903, b 6753 


5900, 6752 



45 



50 



55 



b From codon 158 detection prfcner (unextonded G4S0.3 Da). 

Dashed Ines: this genotype not available from the analyzed pooJ of 1 00 patients. 



[0370] Figure 58a-c shows the ApoE G3/G3 genotype after digestion with CfoJ and a variety of precipitation schemes; 
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equal volume aliquots of the same amplified product were used for each. The sampfe treated with a single precipitation 
(Figure 58a) from an ammonium acetate/ethanol/gtycogen solution results in a mass spectrum characterized by broad 
peaks, especially at high mass. The masses for intense peaks at 5.4, 10.7, and 14.9 kDa are 26 Da (0.5%), 61 Da 
(0.6%), and 45 Da (0.3%) Da higher, respectively, than the expected values; the resolution (the ratio of a peak width 
at haif its total intensity to the measured mass of the peak) for each of these Is -50, and decreases with increasing 
mass. Such observations are consistent with a high level of nonvolatile cation adduction; for the 10.8 kDa fragment, 
the observed mass shift is consistent with a greater than unit ratio of adducted:nonadducted molecular Ions. 
£0371 J MS peaks from a sample redissoived and precipitated a second time are far sharper (Figure 58b), with res- 
olution values nearly double those of the corresponding Figure 58a peaks. Mass accuracy values are also considerably 
improved; each is within 0.07% of its respective calculated values, close to the independently determined instrumental 
limits for DNA measurement using 3-HPA as a matrix. Single (not shown) and double (Figure 58C) precipitations with 
isopropyl alcohol (IPA) instead of ethanoi result in resolution and mass accuracy vatues comparable to those for cor- 
responding ethanoi precipitations, but enhanced levels of dimerization are observed, again potentially confusing meas- 
urements when such dimers overlap with higher mass "diagnostics" monomers present In the solution. EtOH/ammo- 
nlum acetate precipitation with glycogen as a nucieatlon agent results in nearly quantitative recovery of fragments 
except for the 7-mers, serving as a simultaneous concentration and desalting step prior to MS detection. Precipitation 
from the same EtOH/ammonium acetate solutions in the absence of glycogen 'results in far poorer recovery, especially 
at low mass. 

[Q372J The results indicate that to obtain accurate (M r (exp) values after either 1 PA and EtOH precipitations, a second 
precipitation is necessary to maintain high mass accuracy and resolution. 

[0373] The ratio of matrix:digest product also affects spectral quality; severe suppression of higher mass fragments 
(not shown) observed with 1:1 volume matrix: digest product (redissoived in 1u,L) is alleviated by using a 3 - 5 fold 
volume excess of matrix. 

[03741 Af>o E genotyping by enzymatic digestion, Codon 112 and 158 polymorphisms fall within Cfol (but not Rsal) 
recognition sequences. In the 252 bp amplified product studied here, invariant (i.e. cut in ail genotypes) sites cause 
cuts after bases 31 , 47, 1 38, 1 56, 239, and 246. The cutting site after base 66 is only present for G4, while that after 
base 204 is present in E3 and G4; the G2 genotype is cut at neither of these sites. These differences in the restriction 
pattern can be demonstrated as variations in mass spectra. Figure 59 shows mass spectra from several ApoE geno- 
types available from a pool of 100 patients (Braun, A etai, (1992) Hum. Genet 89:401 ^408). Vertical dashed lines are 
drawn through those masses corresponding to the expected Table V diagnostic fragments; other labeled fragments 
are invariant. Referring to Table V, note that a fragment is only considered "invariant" if it is present In duplicate copies 
for a given allele; to satisfy this requirement, such a fragment must be generated in each of the S2m G3, and G4 alleles. 
[0375] The spectrum in Figure 59a contains all of the expected Invariant fragments above 3 kDa, as well as diagnostic 
peaks at 3428 and 4021 (both weak), 11276 and 11627 (both intense), 14845, 18271, and 18865 Da. The spectrum 
In Figure 59b is nearly identical except that the pair of peaks at 1 8 kDa is not detected, and the relative peak Intensities, 
most notably among the 11 -18 kDa fragments, are different. The spectrum in Figure 59c also has no 1 8 kDa fragments, 
but instead has new low intensity peaks between 5-6 kDa. The Intensity ratios for fragments above 9 kDa are similar 
to those of Figure 59b except for a relatively lower 11 kDa fragment pair. Figure 59d, which again contains the 5-6 kDa 
cluster of peaks, is the only spectrum with no 1 1 kDa fragments, and like the previous two also has no 1 8 kDa fragment. 
[0376] Despite the myriad of peaks in each spectrum, each genotype can be identified by the presence and absence 
of only a few of the Table Vb diagnostic peaks. Due to the limited resolution of the MALDI-TOF Instrumentation em- 
ployed, the most difficult genotypes to differentiate are those based upon the presence or absence of the four diagnostic 
fragments between 5.2 and 6.0 kDa characteristic of the G4 allele, since these fragments nearly overlap with severai 
Invariant peaks. It has been found herein that the 5283 Da diagnostic fragment overlaps with a depurination peak from 
the 5412 Da invariant fragment, and the 5781 Da diagnostic peak fs normally not completely resolved from the 5750 
Da Invariant fragment. Thus, distinguishing between an G2/G4 and G2/G3, or between an G3/G4 and an G3/G3 allele, 
relies upon the presence or absence of the 5B80 and 5999 Da fragments. Each of these is present in Figures 59c and 
59d, but not in 59a or 59b. 

[0377] The genotype of each of the patients in Figure 59 can be more rapidly identified by reference to the flowchart 
In Figure 60. Consider the Figure 59a spectrum. The intense pair of peaks at 11 kDa discounts the possibility of ho- 
mozygous G4, but does not differentiate between the other five genotypes. Likewise, the presence of the unresolved 
14.8 kDa fragments is inconsistent with homozygous G2, but leaves four possibilities (G2/G3, G2/G4, e3/G3, G3/G4). 
Of these only G2/E3 and G2/G4 are consistent with the 18 kDa peaks; the lack of peaks at 5283, 5879, 5779, and 
5998, Da Indicate that the Rgure 59a sample Is G2/G3. Using the same procedure, the Figures 59b-d genotypes can 
be Identified as G3/G3, G3/G4, and G4/G4, respectively. To date, all allele identifications by this method have been 
consistent with, and In many cases more easily interpreted than, those attained via conventional methods. The assign- 
ment can be further confirmed by assuring that fragment intensity ratios are consistent with the copy numbers of Table 
V. For instance, the 1 4.B kDa fragments are of lower intensity than those at 1 6- 1 7 kDa in Figure 59a, but the opposite 
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is seen In Figures 59b-d. This is as expected, since In the latter three genotypes the 14.8 kDa fragments are present 
in duplicate, but the first Is a heterozygote containing e2, so that half of the amplified products do not contribute to the 
14.8 kDa signal. Likewise, comparison of the 11 kDa fragment intensify to those at 9.6 and 14.8 kDa indicate that this 
fragment is double, double, single, and zero copy in Figures 59a, d, respectively. These data confirm that MALDI can 
perform In a semi-quantitative way under these conditions. 

[0378] ApoE genotyping by Primer Otigo Base Extension (PROBE). The PROBE reactio n was also tested as a means 
of simultaneous detection of the codon 112 and 1 58 polymorphisms. A detection primer is annealed to a single-stranded 
PCR-amplified template so that its 3' terminus is just downstream of the variable site. Extension of this primer by a 
DNA polymerase in the presence of three dNTPs and one ddXTP (that is not present as a dNTP) results in products 
whose length and mass depend upon the identity of the polymorphic base. Unlike standard Sanger type sequencing, 
in which a particular base-specific tube contains -99% dXTP and -1% ddXTP, the PROBE mixture contains 100% of 
a particular ddXTP combined with the other three dNTPs. Thus with PROBE a full stop of all detection primers is 
achieved after the first base complementary to the ddXTP is reached. 

[0379J For the E2/G3 genotype, the PROBE reaction (mixture of ddTTP, dATP, dCTP, dGTP) causes a M r (exp) shift 
of the codon 112 primer to 5919 Da, and of the codon 158 primer to 6769 and 7967 Da (Table VI); a pair of extension 
products results from the single codon 1 58 primer because the G2/G3 genotype is heterozygous at this position. Three 
extension products (one from codon 158, two from 112) are also observed from the heterozygote G3/G4 (Figure 61c 
and Table VI), while only two products (one from each primer) are observed from the Figure 61b (G3/G3) and Figure 
59d (e4/G4) homozygote alleles. Referring to Table VI, each of the available alleles result in ail expected ddT reaction 
product masses within 0.1% of the theoretical mass, and thus each is unambiguously characterized by this data alone. 
Further configuration of the allele identities may be obtained by repeating the reaction with tSdCTP (plus dATP, oTTP, 
dGTP); these results, summarized also in Table VI, unambiguously confirm the ddT results. 

[0380] Appropriateness of the methods. Comparison of Figures 59 (restriction digestion) and 61 (PROBE) indicates 
that the PROBE method provides far more easily interpreted spectra for the multiplex analysis of codon 112 and 158 
polymorphisms than does the restriction digest analysis. While the digests generate up to -25 peaks per mass spectrum 
and in some case diagnostic fragments overlapping with invariant fragments, the PROBE reaction generates a maxi- 
mum of only two peaks per detection primer (i.e. polymorphism). Automated peak detection, spectrum analysis, and 
allele identification would clearly be far more straightforward for the latter. Spectra for highly multiplexed PROBE, in 
which several polymorphic sites from the same or different amplified products are measured from one tube, are also 
potentially simple to analyze. Underscoring its flexibility, PROBE data analysis can be further simplified by Judicious a 
priori choice of primer lengths, which can be designed so that no primers or products can overlap in mass. 
[0381] Thus while PROBE is the method of choice for large scale clinical testing of previously well characterized 
polymorphic sites, the restriction digest analysis as described here is ideally suited to screening for new mutations. 
The identity of each of the two polymorphisms discussed in this study affects the fragment pattern; if this is the only 
information used, then the MS detection is a faster alternative to conventional electrophoretic separation of restriction 
fragment length polymorphism products. The exact measurement of fragment M r values can also give information on 
about sites completely remote from the enzyme recognition site since other single point mutations necessarily alter 
the mass of each of the single strands of the double stranded fragment containing the mutation. The 252 bp amplified 
product could aJso contain allelic variants resulting in, for example, previously described Glyl27 Asp (Weisgraber, KH 
et ai, (1984) J. Clin, invest. 73:1024-1033), Argl36Ser (Wardell, MR et ai., (1987) J. Clin. Invest. 80:483-490), 
Argl42Cys (Horie, Y et a!., (1992) J. Biol. Chem. 267:1962-1968), Arg145Cys (Rail SC Jr et aL, (1982) Proc. Natl. 
Acad. Sci. U.S.A. 79 :4696-4700), Lysl46Glu (Mann. WA et aL, (1 995) J. Clin . Invest. 96 : 1 1 00-1 1 07), orl_ysl46Gln (Smrt, 
M et ai., (1 990) J. Lipid Res. 31 :45-53) substitutions. The G->A base substitution which codes for the Glyl27 Asp amino 
acid substitution would result in a -1 6 Da shift in the sense strand, and in a + 1 5 Da (C->T) shift in the antisense strand, 
but not in a change in the restriction pattern. Such a minor change would be virtually invisible by electrophoresis; 
however, with accurate mass determination the substitution could be detected; the invariant 55-mer fragment at 1 6240 
(sense) and 1 71 75 Da would shift to 1 6224 and 1 71 90 Da, respectively. Obtaining the mass accuracy required to detect 
such minor mass shifts using current MALDI-TOF instrumentation, even with internal calibration, Is not routine since 
minor unresolved adducts and/or poorly defined peaks limit the ability for accurate mass calling. With high performance 
electrospray ionization Fourier transform (ESI-FTMS) single Da accuracy has been achieved with synthetic oligonu- 
cleotides (Little, DP et a/., (1 995) Proc. Natl. Acad. Sci. U.S.A. 92 :231 8-2322) up to 1 00-mers (Little, DP ef a/., (1 994) 
J. Am. Chem. Soc. 116 :4893-4897). and similar results have recently been achieved with up to 25-mers using MALDJ- 
FTMS (Li, Y ef a/., (1 996) Anai. Chem. 68 :2090-2096). 
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EXAMPLE 13 

A Method for Mass Spectrometry Detection of ON A Fragments Associated With Telomeres e Activity 

5 INTRODUCTION 

[0362] One-fourth of all deaths in the United States are due to malignant tumors (R.K. Jain, (1996) Science 271 : 
1079-1 080). For diagnostic and therapeutic purposes there is a high interest in reliable and sensitive methods of tumor 
cell detection. 

10 [0383] Malignant cells can be distinguished from normal cells by different properties. One of those is the immortali- 
zation of malignant cells which enables uncontrolled cell-proliferation. Normal diploid mammalian ceils undergo a finite 
number of population doublings In culture, before they undergo senescence. It is supposed that the number of popu- 
lation doublings in culture, before they undergo senescence. It is supposed that the number of population doublings 
is related to the shortening of chromosome ends, called telomere, in every cell division. The reason for said shortening 

15 is based on the properties of the conventional semiconservative replication machinery. DNA polymerases only work 
in 5* to 3' direction and need an RNA primer. 

[0384] Immortalization is thought to be associated with the expression of active telomerase. Said teiomerase is a 
ribonucleoprotein catalyzing repetitive elongation of templates. This activity can be detected in a native protein extract 
of telomerase containing cells by a special PCR-system (N.W. Kim et aj. (1994) Science 266:2011-2015) known as 
20 telomeric repeat amplification protocol (TRAP). The assay, as used herein, is based on the telomerase specific exten- 
sion of a substrate primer (TS) and a subsequent amplification of the telomerase specific extension products by a PCR 
step using a second primer (bioCX) complementary to the repeat structure. The characteristic ladder fragments of 
those assays are conventionally detected by the use of gel electrophoretic and labeling or staining systems. These 
methods can be replaced by MALDI-TOF mass spectrometry leading to faster accurate and automated detection. 

25 

MATERIALS AND METHODS 

Preparation of cells 

30 [0385] 1 x 1 0 s cultured telomerase-positive cells were pelleted, washed once with PBS (137 mM NaCI, 2.7 mM KCI, 
4.3 mM Na 2 HP0 4 »7H 2 0, 1 .4 mM KH 2 P0 4 in sterile DEPC water). The prepared cells may be stored at -75°C. Tissue 
samples have to be homogenized, according to procedures well known in the art, before extraction. 

Telomerase extraction 

35 

[0386] Pellet was resuspended in 200 u.1 CHAPS lysis buffer (10 mM Tris-HCI pH 7.5, 1 mM MgCfe, 1 mM EGTA, 
0.1 mM benzamidine, 5 mM p-mercaptoethanol r 0.5% CHAPS, 1 0% glycerol) and incubated on ice for 30 mln. The 
sample was centrifuged at 12,000 g for 30 min at 4°C. The supernatant was transferred into a fresh tube and stored 
at 75°C until use. 

40 

THAP-assay 

[0387] 2 p.I of telomerase extract were added to a mixture of 10x TRAP buffer (200 mM Tris-HCI pH 8.3, 15 mM 
MgCfe, 630 mM KCI, 0.05% Tween 20, 10 mM EGTA) 50x dNTP-mix (2.5 mM each dATP, dTTP, dGTP, and dCTP), 
10 pmol of TS primer and 50 pmol of bio CX primer in a final volume of 50 u,l. The mixture was incubated at 30°C for 
10 minutes and 5 min. at 94°C, 2 units of Taq Polymerase were added and a PCR was performed with 30 cycles of 
94°C for 30 seconds, 50°C for 30 seconds and 72°C for 45 seconds. 

Purification of TRAP-assay products 

50 

[0388] For every TRAP-assay to be purified, 50 u.1 Streptavidin M-280 Dynabeads (10 mg/ml) were washed twice 
with 1x BW buffer (5 mM Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI). 50 u.l of 2x BW buffer were added to the PCR 
mix and the beads were resuspended In this mixture. The beads were incubated under gentle shaking for 15 min. at 
ambient temperature. The supernatant was removed and the beads were washed twice with 1x BW buffer. To the 
55 beads 50 u.1 25% ammonium hydroxide were added and incubated at 60'C for 10 min. The supernatant was saved, 
the procedure repeated, both supernatants were pooled and 300 u.l ethanol (1 00%) were added. After 30 min. the DNA 
was pelleted at 13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 nl ultrapure water. 
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MALDf-TOF MS of TRAP-assay products 

[0389] 300 nl sample were mixed with 500 nl of saturated matrix-solution (3-HPA:ammonium citrate = 10:1 molar 
ratio in 50% aqueous acetonitrile), dried at ambient temperature and introduced into the mass spectrometer (Vision 
5 2000, Finigan MAT). All spectra were collected in reflector mode using external calibration. 

Sequences and masses 

[0390] 

10 

bioCX: d(bio-CCC TTA CCC TTA CCC TTA CCC TAA SEQ ID NO. 45), mass: 7540 Da. 
TS: d(AAT CCG TGC AGC AGA GTT SEQ ID NO.46), mass: 5523 Da. 
Telomeric- repeat structure: (TTAGGG) n , mass of one repeat: 1909.2 Amplification products: 
TS elongated by three telomeric repeats (first amplification product): 12452 Da. (N3) 
is TS elongated by four telomeric repeats: 14361 Da. (N 4 ) 

TS elongated by seven telomeric repeats: 20088 Da. (Ny) 

RESULTS 

20 [0391] Figure 62 depicts a section of a TRAP -assay MALDI-TOF mass spectrum. Assigned are the primers TS and 
bioCX at 5497 and 7537 Da, respectively (calculated 5523 and 7540 Da). The signal marked by an asterisk represents 
n-1 primer product of chemical DNA synthesis. The first telom erase specific TR A P -assay product is assigned at 12775 
Da. This product represents a 40-mer containing three telomeric repeats. Due to primer sequences this is the first 
expected amplification product of a positive TRAP-assay. The product is elongated by an additional nucleotide due to 

2s extendase activity of Taq DNA polymerase (calculated non-extended product: 1 2452 Da, by A extended product: 12765 
Da). The signal at 6389 Da represents the doubly charged ion of this product (calculated: 6387 Da). Figure 63 shows 
a section of higher masses of the same spectrum as depicted in figure 62, therefore the signal at 12775 Da is identical 
to that in figure 62. The TRAP-assay product containing seven telomeric repeats, representing a 64-mer also elongated 
by an additional nucleotide, is detected at 20322 Da (calculated: 20395 Da). The signals marked 1 , 2, 3 and 4 cannot 

30 be base-line resolved. This region includes of: 1 . signal of dimerlc n-1 primer, 2. second TRAP-assay amplification 
product, containing 4 telemeric repeats and therefore representing a 46-mer (calculated: 14341 Da/14674 Da for ex- 
tendase elongated product) and 3. dimerlc primer-ion and furthermore all their corresponding dep urination signals. 
There is a gap observed between the signals of the second and fifth extension product. This signal gap corresponds 
to the reduced band intensities observed in some cases for the third and fourth extension product in autoradiographic 

35 analysis of TRAP-assays (N.W. Kim et at. (1 994) Science 266:201 3). 

[0392] The above-mentioned problems, caused by the dimeric primer and related signals, can be overcome using 
an ultrafiltration step employing a molecular weight cut-off membrane for primer removal prior to MALDI-TOF -MS anal- 
ysis. This will permit an unambiguous assignment of the second amplification product. 

40 EXAMPLE 14 

A method for Detecting Neuroblastoma-Speclflc Nested RT-amp lined products Via MALDI-TOF Mass 
Spectrometry 

45 Introduction 

[0393] Neuroblastoma is predominantly a tumor of early childhood with 66% of the cases presenting in children 
younger than 5 years of age. The most common symptoms are those due to lumor mass, bone pain, or those caused 
by excessive catecholamine secretion. In rare cases, neuroblastoma can be identified prenatally (R.W. Jennings et al, 
50 (1 993) J. Ped. Surgery 28: 1 1 68- 1 1 74). Approximately 70% of all patients with neuroblastoma have metastatic disease 
at diagnosis. The prognosis is dependent on age at diagnosis, clinical stage and other parameters. 
[0394] For diagnostic purposes there is a high interest in reliable and sensitive methods of tumor cell detection, e. 
£ 1 , in control of autologous bone marrow transplants or on- going therapy. 

[0395] Since catecholamine synthesis Is a characteristic property of neuroblastoma cells and bone marrow cells lack 
55 this activity (H. Naito et al. , (1 991 ) Eur. J. Cancer 27 :762-765), neuroblastoma cells or metastasis in bone marrow can 
be Identified by detection of human tyrosine 3-hydroxylase (E.C. 1.14.16.2, hTH) which catalyzes the first step in 
biosynthesis of catecholamines. 

[0396] The expression of hTH can be detected via reverse transcription (RT) polymerase chain reaction (PCR) and 
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the amplified product can be analyzed via MALDI-TOF mass spectrometry. 

Materials and methods 

5 Cell- or tissue-treatment 

[0397] Cultures cells were pelleted (1 0 mln. 8000 rpm) and washed twice with PBS (137 mM IMaCI, 2.7 mM KCI, 4.3 
mM Na 2 HP0 4 *7H 2 0, 1 .4 mM KH 2 P0 4 in sterile PEPC water). The pellet was resuspended in 1 ml lysis/binding buffer 
(100 mM Tris-HCI, pH 8.0, 500 mM LiCI, 10 mM EDTA, 1 % Li-dodecyl sulfate, 5 mM DTT) until the solution becomes 
10 viscose. Viscosity was reduced by DNA-shear step using aim! syringe. The lysate maybe stored in -75° C or processed 
further directly. Solid tissues ( e.g. , patient samples) have to be homogenized before lysis. 

Preparation of magnetic Oiigo-oTT(25) beads 

15 [0398] 100 u.L beads per 1x10 s cells were separated from the storage buffer and washed twice with 200 u.L lysis/ 
binding buffer. 

Isolation of poly A*- RNA 

20 [0399] The cell lysate was added to the prepared beads and incubated for 5 min. at ambient temperature. The beads 
were separated magnetically for 2-5 min. and washed twice with 0.5 ml LDS (10 mM Tris-HCI, pH 8.0, 0.15 M LiCI, 1 
mM EDTA. 0.1%LiDS). 

Solid-phase first-strand cDNA synthesis 

25 

[0400] The poly A+RNA containing beads were resuspended in 20 jiL of reverse transcription mix (50 mM Tris-HCI, 
pH 8.3, 8 mM MgCI 2 , 30 mM KCI, 10 mM DTT, 1 .7 mM dNTPs, 3 U AMV reverse transcriptase) and incubated for 1 
hour at 45° C (with a resuspension step all ten min.). The beads were separated from the reverse transcription mix, 
resuspended in 50 u.L of elution buffer (2 mM EDTA pH 8.0) and heated to 95°C for 1 min. fur elution of the RNA. The 
30 beads with the cDNA first-strand can be stored in TB (0.089 M Tris-base, 0.089 M boric acid, 0.2 mM EDTA pH 8.0), 
TE 10 mM Tris-HCI, 0.1 mM EDTA, pH 8.0) or 70% ethanol for further processing. 

Nested polymerase chain reaction 

35 [0401] Beads containing cDNA first-strand were washed twice with 1x PCR buffer (20 mM Tris-HCI pH B.75, 1 0 mM 
KCI, 10 mM (NH 4 )2S0 4 , 2 mM MgS0 4 , 0.1% Triton X-100. 0.1 mg bovine serum albumin) and resuspended in PCR 
mix (containing 100) pmol of each outer primer, 2.5 u Pfu (exo-) DNA polymerase, 200 u.M of each dNTP and PCR 
buffer in a final volume of 50 u.L). The mixture was incubated at 72°C 1 min. and amplified by PCR for 30 cycles, for 
the nested reaction: 1 u,L of the first PCR was added as template to a PCR mix d(as above but nested primers instead 

40 of outer primers) and subjected to the following temperature program: 94°C 1 min., 65°C 1 mln. and 72°C 1 min. for 
20 cycles. 

Purification of nested amplified products 

45 [0402] Primers and low-molecular reaction by-products are removed using 10,000 Da cut-off ultrafiltration-unit. Ul- 
trafiltration was performed at 7.500 g for 25 minutes. For every PCR to be purified, 50 \iL Streptavidin M-280 Dynabeads 
(10 mg/ml) were washed twice with 1xBW buffer (5 mM Tris-HCI, pH 7.5, 0.5 mM EDTA, 1 M NaCI), added to the 
ultrafiltration membrane and incubated under gentle shaking for 15 mln. at ambient temperature. The supernatant was 
removed and the beads were washed twice with 1xBW buffer. 50 u,L 25% ammonium hydroxide were added to the 

so beads and incubated at ambient temperature for 10 mln. The supernatant was saved, the procedure repeated, both 
supernatants were pooied and 300 u.L ethanol (1 00%) were added. After 30 min. the DNA was pelleted at 1 3,000 rpm 
for 12 min., the pellet was air-dried and resuspended in 600 nl ultrapure water. 

MALDI- TOF MS of nested amplified products 

55 

[0403] 300 nl sample was mixed with 500 nl of saturated matrix-solution (3-HPA: ammonium citrate =10:1 molar 
ratio in 50% aqueous acetonrtrile), dried at ambient temperature and introduced into the mass spectrometer (vision 
2000, FInlgan MAT). AH spectra were collected in reflector mode using external calibration. 
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Outer primers: 
[0404] 

5 hTHl : d(TGT CAG AGC TGG ACA AGT GT SEQ ID NO. 47) 

hTH2: d(GAT ATT GTC TTC CCG GTA GC SEQ ID NO:48) Nested primers: 

blo-hTH d(bio-CTC GGA CCA GGT GTA CCG CC SEQ ID NO:49) ( mass:64B5 Da 

hTH6; d(CCT GTA CTG GAA GGC GAT CTC SEQ ID NO:50) t mass:6422 21 Da 

mass of biotinylated single strand amplified product: 19253:6 Da 
10 mass of nonblotlnylated single strand amplified product: 18758.2 Da 

Results 

[0405] A M AUDI -TO F mass spectrum of a human tyrosine 3-hydroxylase (hTH) specific nested amplified product 
(61 -mer) is depicted in figure 64. The signal at 1 8763 Da corresponds to non-biotinylated strand of the amplified product 
(calculated: 18758.2 Da, mass error: 0.02 Da). The signals below 10,000 and above 35,000 Da are due to multiply 
charged and dimeric amplified product-ions, respectively. 

[0406] The product was obtained from a solid phase cDNA derived in a reverse transcription reaction from 1x10 6 
cells of a neuroblastoma cell-line (L-A-N-1) as described above. The cDNA first-strand was subjected to a first PCR 
so using outer primers (hTH1 and hTH2), an aliquot of this PCR was used as template in a second PCR using nested 
primers (biohTH and hTH6). The nested amplified product was purified and MALDI-TOF MS analyzed: 
[0407] The spectrum In Fig. 64 demonstrates the possibility of neuroblastoma cell detection using nested RT-PCR 
and MALDI-TOF MS analysis. 

25 EXAMPLE 15 

Rapid Detection of the RET Proto-oncogene Codon 634 Mutation Using Mass Spectrometry 

Material and Methods 

30 

Probe 

[0408] The identity of codon 634 in each of the three alleles was confirmed by Rsal enzymatic digestion, single strand 
conformational polymorphism or Sanger sequencing. Exon 11 of the RET gene was PCR amplified (40 cycles) from 

35 genomic DNA using Taq-Polymerase (Boeh ringer-Mannheim) with 8 pmol each of S'-biotinylated forward (S'-blotin- 
CAT GAG GCA GAG CAT ACG CA-3' SEQ ID NO:51) and unmodified reverse <5'-GAC AGC AGC ACC GAG ACG 
AT-3' SEQ ID NO: 52) primer per tube; amplified products were purified using the Qiagen (QlAquick" kit to remove 
unincorporated primers. 15 u,l of amplified product were immobilized on 10 uX (10 mg/mL) Dynal streptavidin coated 
magnetic beads, denatured using the manufacturer's protocol, and the supernatant containing antisense strand dis- 

40 carded, the PROBE reaction was performed using thermoSequenase (TS) DNA Polymerase (Amersham) and Phar- 
macia dNTP/ddNTPs. 8 pmol of extension primer (5*-CGG CTG CGA TCA CCG TGC GG-3* SEQ ID NO: 53) was added 
to 13 uJ- H 2 0, 2u.L TS-buffer, 2u,L 2mM ddATP (or doTTP), and 2u.L of 0.5 mM dGT P/dCTP/dTTP (or dGTP/DCTP/ 
dATP), and the mixture heated for 30 sec @ 94*C, followed by 30 cycles of 10 sec @ 94 Q C and 45 sec @ 50°C; after 
a 5 min. incubation @ 95° C, the supernatant was decanted, and products were desalted by ethano! precipitation with 

<5 the addition of 0.5 u.L of 1 0mg/mL glycogen. The resulting pellet was washed in 70% ethanol, air dried, and suspended 
in 1 u.L H 2 0. 300 nL of this was mixed with the MALDI matrix (0.7 M 3-hydroxyplcollnic acid, 0.07 M ammonium citrate 
in 1:1 H 2 0:CH 3 CN) on a stainless steel sample probe and air dried. Mass, spectra were collected on a Thermo Bio- 
nalysis vision 2000 MALDI-TOF operated In reflectron mode with 5 and 20 kV on the target and conversion dynode, 
respectively. Experimental masses (m r (exp)) reported are those of the neutral molecules as measured using external 

so calibration. 

Direct Measurement of Diagnostic Products 

[04O9] PCR amplifications conditions for a 44 bp region containing codon 634 were the same as above but using 
« pfu polymerase; the forward primer contained a ribonucleotide at its 3'-terminus (forward, 5'-GAT CCA CTG TGC GAC 
GAG C (SEQ ID NO:54) -ribo; reverse, 5'-GCG GCT GCG ATC ACC GTG C (SEQ ID NO:55). After product immobi- 
lization and washing, 80 \iL of 1 2.5% NH 4 0H was added and heated at 80*C overnight to cleave te primer from 44-mer 
(sense strand) to give a 25-mer. Supernatant was pipetted off while still hot, dried resuspended In 50 u.L H 2 0, precip- 
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itated, resuspended, and measured by MALDI-TOF as above. MALDl-FTMS spectra of 25-mer synthetic analogs were 
collected as previously described (Li, Y. et aL, (1996) Ana!. Chem. 68:2090-2096); briefly, 1-10 pmol DNA was mixed 
1:1 with matrix on a direct insertion probe, admitted into the externa! ion source (positive ion mode), ionized upon 
irradiance with a 337 nm wavelength laser pulse, and transferred via rf-only quadruple rods into a 6.5 Tesla magnetic 
s field where they were trapped collisionally. After a 1 5 second delay, ions were excited by a broadband chirp pulse and 
detected using 256K data points, resulting in time domain signals of 5 s duration. Reported (neutral) masses are those 
of the most abundant isotope peak after subtracting the mass of the charge carrying proton (1 .01 Da). 

Results 

10 

[0410] The first scheme presented utilizes the PROBE reaction shown schematically in Figure 65. A 20-mer primer 
is designed to bind specifically to a region on the complementary template downstream of the mutation site; upon 
annealing to the template, which is labelled with biotin and immobilized to streptavidin coated magnetic beads, the 
PROBE primer is presented with a mixture of the three deoxynucleotide triphosphates (dNTPs), a di-dNTP (ddNTP), 
* 5 and a DNA polymerase (Figure 65). The primer Is extended by a series of bases specific to the identity of the variable 
base in codon 634; for any reaction mixture (e.g. , ddA+dT+dC+dG), three possible extension products representing 
the three alleles are possible (Figure 65). 

[041 1 J For the negative control (Figure 66), the PROBE reaction with ddATP+dNTPs (N = T, C, G) causes a M r (exp) 
shift of the primer from 6135 to 6726 Da (Am + 591 ). The absence of a peak at 6432 rules out a C->A mutation (Figure 

20 65); the mass of the single observed peak is more consistent with extension by C-ddA (M^calc) 6721 , +0.07% error) 
than by T-ddA (Mr(catc) 6736, -0.15% error) than of A 3 TC 2 G expected for C-»A mutant. Combining the ddA and ddT 
reaction data, it is clear that the negative control is as expected homozygous normal at codon 634. 
[0412] The ddA reaction for patient 1 also results in a single peak (M r (exp) = 6731) between expected values for 
wildtype and C-»T mutation (Figure 65b). The ddT reaction, however, results in two clearly resolved peaks consistent 

25 with a heterozygote wildtype (M^exp) 8249, +0.04% mass error)/C-*T mutant (M r (exp) 6428 Da, +0.08% mass error). 
For patient 2, the pair of Figure 66c ddA products represent a heterozygote C-»A (M^exp) 6431 , -0.06% mass error)/ 
normal (M r (exp) 671 9, -0.03% mass error) allele. The ddT reaction confirms this, with a single peak measured at 8264 
Da consistent with unresolved wildtype and C-»A alleles. The value of duplicate experiments is seen by comparing 
Figures 66a and 66b; while for patient 1 the peak at 6726 from the ddA reaction represents only one species, similar 

so peak from patient 1 is actually a pair of unresolved peaks differing in mass by 1 5 Da. 

[0413] An alternate scheme for point mutation detection is differentiation of alleles by direct measurement of diag- 
nostic product masses. A 44-mer containing the RET634 site was generated by the PCR, and the 1 9-mer sense primer 
removed by NH 4 OH cleavage at a ribonucleotide at its 3* terminus. 

[0414] Figure 67 shows a series of MALDl-FTMS spectra of synthetic analogs of short amplified products containing 
35 the RET634 mutant site. Figures 67a-c and 67d-f are homozygous and heterozygous genotypes, respectively. An 
internal calibration was done using the most abundant isotope peak for the wildtype allele; application of this (external) 
calibration to the five other spectra resulted in better than 20 ppm mass accuracy for each. Differentiation by mass 
aJone of the alleles is straightforward, even for heterozygote mixtures whose components differ by 16.00 (Figure 67d), 
2501 (Figure 67e), or 9.01 Da (Figure 65f). The value of high performance MS is clear when recognition of small DNA 
40 mass shifts is the basis for diagnosis of the presence or absence of a mutation. The recent reintroduction of delayed 
extraction (DE) techniques has Improved the performance of MALDI-TOF with shorts DNAs (Roskey, M.T. et aL, (1 996) 
Anal. Chem. 68 :941-946); a resolving power (RP) oMO 3 has been reported for a mixed-base 50-mer, and a pair of 
31 -mere with a C or a T (Am 15 Da) at a variable position resolved nearly to baseline. Thus DE-TOF-MS has demon- 
strated the RP required for separation of the individual components of hetero zygotes. Even with DE, however, the 
45 precision of DNA mass measurement with TOF Is typically 0.1 % (8 Da at 8 kDa) using external calibration, sufficiently 
high to result in incorrect diagnoses. Despite the possibility of space charge induced frequency shifts (Marshall, A.G. 
et a!. (1 991) Anal. Chem. 63:21 5A-229A), MALDl-FTMS mass errors are rarely as high as 0.005% (0.4 Da at 8 KDa), 
making internal calibration unnecessary. 

[0415] The methods for DNA point mutation presented here are not only applicable to the analysis of single base 
50 mutations, but also to less demanding detection of single or multiple base insertions or deletions, and quantification 
of tandem two, three, or four base repeats. The PROBE reaction yields products amenable to analysis by relatively 
low performance ESI or MALDI instrumentation; direct measurement of short amplified product masses Is an even 
more direct means of mutation detection.and will likely become more widespread with the Increasing interest in high 
performance MS available with FTMS. 

55 
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EXAMPLE 16 

Immobilization of nucleic acids on solid supports via an acld-lablle covalent blfunctlonal trltyl linker 

[0416] Amino linked DNA was prepared and purified according to standard methods. A portion (10eq) was evaporated 
to dryness on a speedvac and suspended in anhydrous DMF/pyrldlne (9:1 ; 0.1 ml). To this was added the chlorotrityl 
chloride resin (1 eq, 1 .05jimolAng loading) and the mixture was shaken for 24 hours. The loading was checked by 
taking a sample of the resin, detritylating this using 80% AcOH, and measuring the absorbance at 260nm. Loading 
was ca. 150prnol/mg resin. 

[041 7] tn 80% acetic acid, the halMife of cleavage was found to be substantially less than 5 minutes-this compares 
with trityl ether-based approaches of half-lives of 1 05 and 39 minutes for para and meta substituted blfunctlonal dimeth- 
oxytrltyl linkers respectively. Preliminary results have also indicated that the hydroxy picolinic acid matrix alone is 
sufficient to cleave the DNA from the chlorotrityl resin. 

EXAMPLE 17 

Immobilization of nucleic acids on solid supports via hydrophobic trltyl linker 

[0418] The primer contained a S'-dimethoxytrityl group attached using routine trityl-on DNA synthesis. 

[041 9] Ci8 beads from an oligo purification cartridge (0.2 mg) placed in a filter tip was washed with acetonitrile, then 

the solution of DNA (50 ng in 25 pi) was flushed through. This was then washed with 5% acetonitrUe in ammonium 

citrate buffer (70 mM, 250 u.l). To remove the DNA form the CI8, the beads were washed with 40% acetonitrile in water 

(10uJ) and concentrated to ca 2 u.l on the Speedvac. The sample was then submitted to MALDI. 

[0420] The results showed that acetonitrile/water at levels of ca.>30% are enough to dissociate the hydrophobic 

interaction. Since the matrix used in MALDI contains 50% acetonitrile, the DNA can be released from the support and 

successfully detected using MALDI-TOF MS (with the trityl group removed during the MALDI process). 

[0421 ] Figure 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers. 

EXAMPLE 18 

Immobilization of nucleic acids on solid supports via Streptavidln-lmlrtobiotin 

Experimental Procedure 

[0422] 2- iminobiotin N-hydroxy-succinim id ester (Sigma) was conjugated to the oligonucleotides with a 3'- or 5-'amino 
linker following the conditions suggested by the manufacturer The completion of the reaction was confirmed by MALDI- 
TOF MS analysts and the product was purified by reverse phase HPLC. 

[0423] For each reaction, 0. 1 mg of streptavidin-coated magnetic beads (Dynabeads M-280 Streptavidin from Dynal) 
were incubated with 80 pmol of the corresponding oligo in the presence of 1 M NaCI and 50 mM ammonium carbonate 
(pH 9.5) at room temperature for one hour. The beads bound with oligonucleotides were washed twice with 50 mM 
ammonium carbonate (pH 9.5). Then the beads were incubated in 2uJ of 3-HPA matrix at room temperature for 2 min. 
An aliquot of 0.5 pj of supernatant was applied to MALDI-TOF. For biotln displacement experiment, 1 .6. mol of free 
biotin (80-fold excess to the bound oligo) in 1 jil of 50 mM ammonium citrate was added to the beads. After a 5 min. 
incubation at room temperature, 1 uJ of 3-HPA matrix was added and 0.5 u.1 of supernatant was applied to MALDI-TOF 
MS. To maximize the recovery of the bound iminobiotin oligo, the beads from the above treatment were again incubated 
with a 2 u.1 of 3-HPA matrix and 0.5 uJ of supernatant was applied to MALDI-TOF MS. The matrix alone and free biotin 
treatment quantitatively released iminobiotin oligo off the streptavidin beads as shown in Figures 70 and 71. 

EXAMPLE 19 

Mutation Analysis Using Loop Primer Oligo Base Extension 
MATERIALS AND METHODS 

[0424] Genomic DNA. Genomic DNA was obtained from healthy individuals and patients suffering from sickle cell 
anemia. The wildtype and mutated sequences have been evaluated conventionally by standard Sanger sequencing. 
[0425] PCR-Amptincatkm. PCR amplifications of a part of the P globin was established and optimized to use the 
reaction product without a further purification step for capturing with streptavidin coated bead. The target amplification 
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for LOOP-PROBE reactions were performed with the loop-cod5 d(GAG TCA GGT GCG CCA TGC CTC AAA CAG 
ACA CCA TGG CGC, SEQ ID No. 58) as forward primer and 0-11 -bio d(TCT CTG TCT CCA CAT GCC CAG, SEQ 
ID. No. 59) as biotinyiated reverse primer. The underlined nucleotide in the loop-cod5 primer is mutated to introduce 
an invariant Cfol restriction site into the amplicon and the nucleotides in italics are complementary to a part of the 
amplified product. The total PCR volume was 50u,l including 200 ng genomic DNA, 1U Taq-polymerase (Boehringer- 
Mannheim, Cat# 1596594), 1.5 mM MgCI 2 . 0.2 mM dNTPs (Boehringer-Mannhelm, Ca# 1277049), and 10 pmol of 
each primer. A specific fragment of the p-globin gene was amplified using the following cycling condition: 5 min 94° C 
followed by 40 cycles of : 30 sec @ 94°C, 30 sec © 56°C, 30 sec @ 72°C, and a final extension of 2 min at 72°C. 
[0426] Capturing and denaturation of biotinyiated templates. 10uJ paramagnetic beads coated with streptavidin 
(lumo/ml; Dynal, Dynabeads M-280 streptavidin Cat# 1 1 2.06) and treated with 5x binding solution (5 M NH^CI, 0.3M 
NH 4 OH) were added to 40 u.1 PCR volume (1 OuJ of the amplified product was saved for check electrophoresis). After 
incubation for 30 min at 37°C the supernatant was discarded. The captured templates were denatured with 50 uJ 100 
mM NaOH for 5 min at ambient temperature, then washed once with 50 uJ 50 mM NH 4 OH and three times with 1 00 u.1 
10mM Tris.CI, pH 8.0. The single stranded DNA served as templates for PROBE reactions. 

[0427] Primer oligo base extension (PROBE) reaction. The PROBE reactions were performed using Sequenase 2.0 
(USB Cat# E70775Z including buffer) as enzyme and dNTPs and ddNTPs supplied by Boehringer-Mannheim (Cat# 
1277049 and 10083B2). The ratio between dNTPs (dCTP, dGTP, dTTP) and ddATP was 1 :1 and the total used con- 
centration was 50 u.M of each nucleotide. After addition of 5u.l 1-fold Sequenase-buffer the beads were incubated for 
5 min at 65°C and for 10 min at 37*C. During this time the partially self complementary primer annealed with the target 
site. The enzymatic reaction started after addition of 0.5 uJ 100 mM dithiothreitol (DTT), 3.5 ul dNTP/ddNTP solution, 
and 0.5 uJ Sequenase (0.8 U) and incubated at 37°C for 10 min. Hereafter, the beads were washed once in 1 -fold TE 
buffer (10 mM Tris, 1mM EDTA, pH 8.0). 

[0428] Cfol restriction digest The restriction enzyme digest was performed in a total volume of 5uJ using 10 U Cfol 
in 1-fold buffer L purchased from Boehringer-Mannheim. The incubation time was 20 min at 37°C. 

Conditioning of the diagnostic products for mass spectrometric analysis 

[0429] After the restriction digest, the supernatant was precipitated in 45 uJ H 2 0, 10 u.1 3M NH4- acetate (pH 6.5), 
0.5 u.1 glycogen (10 mg/ml in water, Sigma, Cat# G1 765), and 110u.l absolute ethanol for 1 hour at room temperature. 
After centrifugation at 13,000 g for 10 min the peilet was washed in 70% ethanol and resuspended in 2 uJ 18 MohnV 
cm H 2 0. The beads were washed in 100 u.1 0.7 M NH^ citrate followed by 100 u.1 0.05 M NH4. citrate. The diagnostic 
products were obtained by heating the beads in 2 uJ 50 mM NH 4 OH at 80*0 for 2 min. 

Sample preparation and analysis on MALDI-TOF mass spectrometry. 

[0430] Same preparation was performed by mixing 0.6 u.1 of matrix solution (0.7 M 3-hydroxypico!inlc acid, 0.07 M 
dibasic ammonium citrate in 1 :1 H 2 0:CH 3 CN) with 0.3 u.l of either resuspended DNA/glycogen pellet or supernatant 
after heating the beads in 50 mM NH 4 OH on a sample target and allowed to air dry. The sample target was automatically 
introduced in to the source region of an unmodified Perspective Voyager MALDI-TOF operated In delayed extraction 
linear mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical molecular mass (M r (calc)) 
were calculated from atomic compositions; reported experimental (M r (exp)) values are those of the singfy-protonated 
form. 

RESULTS 

[0431 ] The LOOP-PROBE has been applied to the detection of the most common mutation of codon 6 of the human 
p-globin gene leading to sickle cell anemia. The single steps of the method are schematically presented in figure 72. 
For the analysis of codon 6, a part of the p-globin gene was amplified by PCR using the biotinyiated reverse primer 
011 bio and the primer Ioop-cod5 which is modified to Introduce a Cfol recognition site (fig. 72a). The amplified product 
is 192 bp in length. After PCR the amplification product was bound to streptavidin coated paramagnetic particles as 
described above. The antisense strand was isolated by denaturation of the double stranded amplified product (Fig. 
72b). The intra-molecule annealing of the complementary 3* end was accomplished by a short heat denaturation step 
and incubation at 37*C. The 3' end of the antisense strand is now partially double stranded (Fig. 72c). For analyzing 
the DNA downstream of the self annealed 3*-end of the antisense strand, the primer oligo base extension (PROBE) 
has been performed using ddATP, dCTP, dGTP, dTTP (Fig. 72d). This generates different products in length specific 
for the genotype of the analyzed individual. Before the determination of the length of these diagnostic products, the 
DNA was incubated with the Cfol restriction endonuclease that cuts 5* of the extended product. This step frees the 
stem loop from the template DNA whereas the extended product still keeps attached to the template. The extended 
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products are then denatured by heating from the template stand and analyzed by MALDf-TOF mass spectrometry. 
[0432] Si nee the MALDI-TOF analyses were performed with a non -calibrated instrument, the mass deviation between 
observed and expected values was approximately 0.6% higher than theoretically calculated. Nevertheless, the results 
obtained were conclusive and reproducible within repeated experiments. In all analyzed supematants after the restric- 
tion digest the stem loop could be detected. Independent of the genotype, the stem loop has had in all analyses mo- 
lecular masses about 8150 Da (expected 8111 Da). An example is shown in Figure 73a. The second peak In this figure 
with a mass of 4076 Da is a doubly charged ion of the stem loop. Figure 73b to 73d show the analyses of different 
genotypes as indicated in the respective inserts. HbA is the wildtype genotype and HbC and HbS are two different 
mutations in codon 6 of the 0-globin gene which cause sickle cell disease. In the wildtype situation a single peak with 
a molecular mass of 4247 Da and another with 6696 Da are detected (Fig. 73b). The latter corresponds to the bioti- 
nylated PCR primer (0-11 -bio) unused In the PCR reaction which also has been removed in some experiments. The 
former corresponds to the diagnostic product for HbA. The analyses of the two individual DNA molecules with HbS 
trait as well as compound heterozygosity (HbS/HbC) for the sickle cell disorder lead also to unambiguous expected 
results (Fig. 73c and 73d). 

[0433] In conclusion, the LOOP-PROBE is a powerful means for detection of mutations especially predominant dis- 
ease causing mutations or common polymorphisms. The technique eliminates one specific reagent for mutation de- 
tection and, therefore, simplifies the process and makes it more amenable to automation. The specific extended product 
that is analyzed is cleaved off from the primer and is therefore shorter compared to the conventional method. In addition, 
the annealing efficiency is higher compared to annealing of an added primer and should therefore generate more 
product. The process is compatible with multiplexing and various detection schemes (e.g. , single base extension, oligo 
base extension and sequencing). For example, the extension of the loop-primer can be used for generation of short 
diagnostic sequencing ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as 
well as species typing (e.g. , Mycobacterium tuberculosis)). 

EXAMPLE 20 

T7-RNA Polymerase Dependent Amplification of CKR-5 and Detection by MALDI-MS 
MATERIALS AND METHODS 

[0434] Genomic DNA. Human genomic DNA was obtained from healthy individuals. 

[0435] PCR-AmpliHcation and Purification. PCR amplification of a part of the CKR-5 gene was accomplished using 
ckrT7f as sense primer d(ACC TAG CGT TCA GTT CGA CTG AG A TAA TAC GAC TCA CTA TAG CAG CTC TCA 
TTTTCC ATA C (SEQ ID NO. 60). The underlined sequence corresponds to the sequence homologous to CKR-5, the 
bofded sequence corresponds to the T7-RNA polymerase promoter sequence and the Italic sequence was chosen 
randomly. ckrSr was used as antlsense primer d(AAC TAA GCC ATG TGC ACA ACA (SEQ ID NO. 61). Purification of 
the amplified product and removal of unincorporated nucleotides was carried out using the QIAquick purification kit 
(Qiagen, cat# 28104). In the final PCR volume of 50 u.l were 200 ng genomic DNA, 1 U Taq-polymerase (Boehringer- 
Mannheim. cat# 1 596594), 1 .5 mM MgCI 2 0.2 mM dNTPs (Boehringer-Mannheim, cat# 1277049), and 1 0 pmol of each 
primer. The specific fragment of the CKR-5 gene was amplified using the following cycling conditions: 5 min & 94° C 
followed by 40 cycles of 45 sec @ 94°C, 45 sec 52°C, 5 sec @ 72 g C, and a final extension of 5 min at 72°C. 
[0436] T7-RNA Polymerase conditions. One third of the purified DNA (about 60ng) was used in the T7-RNA polymer- 
ase reaction. (Boehringer-Mannheim, cat# 881 767). The reaction was carried out for 2h at 37°C according to the 
manufacturer's conditions using the Included buffer. The final reaction volume was 20 u>l 0.7 uJ RNasin (33 U/u.l) had 
been added. After the extension reaction, the enzyme was inactivated by incubation for 5 min at 65°C. 

DNA digestion and conditioning of the diagnostic products for mass spec analysis. 

[0437] The template DNA was digested by adding RNase-free DNase I (Boehrlnger-Mannheimn, cat# 776 758) to 
the inactivated T7 mixture and incubation for 20 min at room temperature. Precipitation was carried out by adding 1 
u.l glycogen (10 mg/ml, Sigma, cat# G1765), 1/10 volume 3M NHg. acetate (pH 6.5), and 3 volume absolute ethanol 
and Incubation for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min, the pellet was washed in 
70% ethanol and resuspended In 3 uJ 18 Mohm/cm H 2 0. 1 ui was analyzed on an agarose gel. 

Sample preparation and analysis on MALDI-TOF mass spectrometry 

[0438] Sample preparation was performed by mixing 0.6 u.l of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M 
dibasic ammonium citrate in 1 :1 H 2 0:CH 3 CN) with 0.3 ul of resuspended DNA/glycogen on a sample target and allowed 
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to air dry. The sample target was introduced into the source region of an unmodified Finnigan VISION2000 MALDI- 
TOF operated in relectron mode with 5kV. The theoretical molecular mass was calculated form atomic composition; 
reported experimental values are those of singry-pronated form. 

5 RESULTS 

[0439] The chemoklne receptor CKR-5 has been Identified as a major coreceptor in HIV-1 (see e.g., WO 96/39437 
to Human Genome Sciences; Cohen, J. et ai. Science 275 :1261). A mutant allele that is characterized by a 32 bp 
deletion is found in 16% of the HIV-1 seronegative population whereas the frequency of this allele is 35% lower in the 

10 HIV-1 seropositive population. It is assumed that individuals homozygous for this allele are resistant to HIV-1. The 
77- RNA polymerase dependent amplification was applied to identify this specific region of the chemoklne receptor 
CKR-5 (Figure 74). Human genomic DNA was amplified using conventional PCR. The sense primer has been modified 
so that it contains a random sequence of 24 bases that facilitate polymerase binding and the T7-RNA polymerase 
promoter sequence (Figure 75). The putative start of transcription is at the first base 5* of the promoter sequence. ckr5r 

15 was used as an antisense primer. PCR conditions are outlined above. The amplified product derived from wildtype 
alleles is 75 bp in length. Primer and nucleotides were separated from the amplification product using the Qiagen 
QIAquIck purification kit. One third of the purified product was applied to in vitro transcription with T7-RNA polymerase. 
To circumvent interference of the template DNA, it was digested by adding RNase-free DNase I. RNA was precipitated 
and this step also leaves the degraded DNA in the supernatant Part of the redissolved RNA was analyzed on an 

20 agarose gel and the rest of the sample was prepared for MALDI-TOF analysis. The expected calculated mass of the 
product is 24560 Da. A dominant peak, that corresponds to an approximate mass of 25378.5 Da can be observed. 
Since the peak is very broad, an accurate determination of molecular mass was not possible. The peak does not 
correspond to residual DNA template. First, the template DNA is digested, and second, the DNA strands would have 
a mass of 23036.0 and 23174 Da, respectively 

25 [0440] This example shows that T7 RNA polymerase can effectively amplify target DNA. The generated RNA can 
be detected by Mass spectrometry. In conjunction with modified (e.g. , 3 i -deoxy)ribo nucleotides that are specifically 
incorporated by a RNA polymerase but not extended any further, this method can be applied to determine the sequence 
of a template DNA. 

30 EXAMPLE 21 

MALDI Mass Spectrometry of RNA Endonuclease Digests 

MATERIALS 

35 

[0441 J Synthetic RNA (Sample A:5'- UCCGGUCUGAUGAGUCCGUGAGGAC-S 1 (SEQ ID 62); sample B:5'-GUCAC- 
UACAGGUGAGCUCCA-3' (SEQ ID NO 63); sample C^'-CCAUGCGAGAGUAAGUAGUA-S' (SEQ ID NO. 64)) sam- 
ples were obtained from DNA technology (Aahus, Denmark) and purified on a denaturing polyacrylamide gel (Shaler, 
T. A. era/. (1996) Anal. Chem. 63 :5766-579). Rnases T n (Eurogentec), U 2 (Calblochem), A (Boehrlnger-Mannheim) 
■« and PhyM (Pharmacia) were used without additional purification. Streptavidin-coated magnetic beads (Dynabeads M- 
280 Streptavidin, Dynal) were supplied as a suspension of 6-7 x 10 s bead/ml (10 mg/ml) dissolved in phosphate- 
buffered saline (PBS) containing 0.1% BSA and 0.02% NaN 3 . 3-Hydroxypicolinic acid (3-HPA) (Aldrich) was purified 
by a separate desalting step before use as described in more detail elsewhere (Little, D. P. et a/. (1 995) Proc. Natl. 
Acad. Sci. U.S.A. 92 . 23 1 8-2322). 

45 ~ 

METHODS 

[0442] In vitro transcription reaction. The 5'-biotinylated 49 nt in vitro transcript (SEQ ID No. 65): 

50 

AGGCCUGCGGCAAGACGGAAAGACCAUGGUCCCUNAUCUGCCGCAGGAUC 

was produced by transcription of the plasmid pUTMS2 (linearized with the restriction enzyme BamHI) with T7 RNA 
polymerase (Promega). For the transcription reaction 3 u.g template DNA and 50u T7 RNA polymerase were used in 
ss a 50 |il volume of 1 u/u.l RNA guard (Rnax inhibitor, Pharmacia), 0.5 mM NTP's 1 .0 mM 5*-biotin-ApG dinucleotide, 40 
mM Tris-HCl (pH 8.0), 6 mM MgClj, 2 mM spermidine and 10 mM DTT Incubation was performed at 37°C for 1 hour, 
then another aliquot of 50 units T7 RNA polymerase was added and incubation was continued for another hour. The 
mixture was adjusted to 2M NH^ acetate and the RNA was precipitated by addition of one volume of ethanol and one 
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volume of isopropanol. The precipitated RNA was collected by centrlfugation at 20,000 X g for 90 min at 4°C, the pellet 
was washed with 70% ethanol, dried and redissolved at 8 M urea. Further purification was achieved by electrophoresis 
through a denaturing polyacrylamide gel as described elsewhere <Shaler, T. A. etal (1 996) Anal. Chem. 68:576-579). 
The ration of 5'-biotiny!ated to non-biotinylated transcripts was about 3:1 . 

[0443] Ribonucfease assay. For partial digestion with selected RNases different enzyme concentrations ad assay 
conditions were employed as summarized in table VII. The solvents for each enzyme were selected following the 
suppliers' instructions. The concentrations of the synthetic RNA samples and the fn vitro transcript were adjusted to 
5-10 x lO^M. 
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[0444] The reaction was stopped at selected times by mixing 0.6 jil aliquots of the assay with 1 .5 \i\ of 3 HPA-sofution . 
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The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysts. 

[0445] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 |xl> of 25% ammonium hydroxide and 
RNA sample (5-10 x 10 6 M) at 60°C. 1 pi aliquots were taken out at selected times and dried in a stream of cold air. 
For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1 .5pl of the matrix 

5 solution and 0.7 pi of NH 4 + loaded cation exchanged polymer beads were added. 

[0446] The reaction was stopped at selected times by mixing 0.6 pi aliquots of the assay with 1 .5 pi of 3HPA-solutlon. 
The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis. 
[0447] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 pi) of 26% ammonium hydroxide and 
RNA sample (5-10 x lOr 6 M) at 60*0. 1 pJ aliquots were taken out at selected times and dried in a stream of cold air. 

io For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1 .5 pi of the matrix 
solution and 0.7 pi if a suspension of NH 4 + loaded cation exchange polymer beads were added. 
[0448] Separation of 5-bkrtinyiated fragments. Steptavidin-coated magnetic beads were utilized to separate 5*-bioti- 
nylated fragments of the in vitro transcript after partial RNase degradation. The biotin moiety in this sample was intro- 
duced during the transcription reaction initiated by the 5'-biotln-pApG-dinucieotlde. Prior to use, the beads were washed 

15 twice with 2 x binding & washing (b&w) buffer (20 mM Tris-HCI, 2 mM EDTA, 2 M NaCI pH 8.2) and resuspended at 
1 0 mg/ml in 2 x b&w buffer. Circa 25 pmol of the RNA in vitro transcript were digested by RNase U2 using the protocol 
described above. The digestion was stopped by adding 3 uJ of 95% formamide containing 10 mM trans- 1 ,2-diamino- 
cyclohexane-N,N,N\N 1 -tetraacetic acid (CDTA) at 90°C for 5 min, followed by cooling on ice. Subsequently, capture 
of the biotinylated fragments was achieved by incubation of 6pJ of the digest with 6 pi of the bead suspension and 3pl 

20 of b&w buffer at room temperature for 15 min. Given the binding capacity of the beads of 200 pmoi of biotinylated 
oligonucleotide per mg of beads, as specified by the manufacturer, the almost 2-times excess of oligonucleotide was 
used to assure a full loading of the beads. The supernatant was removed, and the beads were washed twice with 6uJ 
of H z O. The CDTA and 95% formamide at 90°C for 5 min. After evaporation of the solvent and the formamide the £2.5 
pmol of fragments were resuspended In 2 pi H 2 0 and analyzed by MALDI-MS as described above. 

25 [0449] Sample preparation for MALDi-MS. 3-Hydroxypicolinic acid (3-HPA) was dissolved In ultra pure water to a 
concentration of ca. 300 mM. Metal cations were exchanged against NH 4 4 as described in detail previously. (Little, D. 
P. etat. (1995) Proc. Natl. Acad. Set. U.S.A. 92: 2318-2322). Aliquots of 0.6 pi of the analyte solution were mixed with 
1 .5 pi 3-HPA on a flat inert metal substrate. Remaining alkali cations, present in the sample solution as well as on the 
substrate surface, were removed by the addition of 0.7 pi of the solution of NH 4 + - loaded cation exchange polymer 

30 beads. During solvent evaporation, the beads accumulated in the center of the preparation, were not used for the 
analysis, and were easily removed with a pipette tip. 

[0450] instrument A prototype of the Vision 2000 (ThermBioanalysis, Hemel, Hempstead, UK) reflectron time of 
flight mass spectrometer was used for the mass spectrometry. Ions were generated by irradiation with a frequency- 
tripled ND:YAG laser (355 nm, 5 ns; Spektrum GmbH, Berlin, Germany) and accelerated to 10 ke V. Delayed ion 

35 extraction was used for the acquisition of the spectra shown, as it was found to substantially enhance the signal to 
noise ratio and/or signal intensity. The equivalent flight path length of the system is 1.7 m, the base pressure is 10 4 " 
Pa. Ions were detected with a discrete dy node secondary- electron multiplier (R2362, Hamamatsu Photonics), equipped 
with a conversion dynode for effective detection of high mass ions. The total impact energy of the ions on the conversion 
dynode was adjusted to values ranging from 16 to 25 keV, depending on the mass to be detected. The preamplified 

40 output signal of the SEM was digitized by a LeCroy 9450 transient recorder (LeCroy, Chestnut Ridge, NY, USA) with 
a sampling rate of up to 400 MHz. For storage and further evaluation, the data were transferred to a personal computer 
equipped with custom-made software (ULISSES). Ail spectra shown were taken in the positive Ion mode. Between 20 
and 30 single shot spectra were averaged for each of the spectra shown. 

45 RESULTS 

[0451] Specificity of Rnases. Combining base-specific RNA cleavage with MALDI-MS requires reaction conditions 
optimized to retain the activity and specificity of the selected enzymes on the one hand and complying with the boundary 
conditions for MALDI on the other. Incompatibility mainly results because the alkaline-Ion buffers, commonly used in 

so the described reaction, such as Na-phosphate, Na-citrate or Na-acetate as well as EDTA interfere with the MALDI 
sample preparation; presumably they disturb the matrix crystallization and/or analyte incorporation. Tris-HCI or am- 
monium salt buffers, in contrast, are MALDI compatible (Shaler, T. A. era/. (1 996) Anal. Chem. 68 :576-579). Moreover, 
alkaline salts in the sample lead to the formation of a heterogenous mixture of multiple salts of the analyte, a problem 
increasing with increasing number of phosphate groups. Such mixtures result in loss of mass resolution and accuracy 

55 as well as signal-to- noise ratio (Little, D. P. et ai. (1995) Proc. Natl. Acad. Sci. U.S.A. 92:2318-2322; Nordhoff, E., 
Cramer, R. Karas, M.,Hillenkamp, R, Ktrpekar, F., Kristiansen, K. and Roepstorff, P. (1993) Nudeh Acids Res., 21, 
3347-3357). Therefore, RNase digestions were carried out under somewhat modified conditions compared to the ones 
described in the literature. They are summarized above In table VII. For Rnase T 1f A, CL 3 ad Cusatrvin, Tris-HCI (pH 
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6-7.5) was used as buffer. 20 mM OAC provides the pH of 5, recommended for maximum activity of RNases U 2 and 
PhyM. The concentration of 10-20 mM of these compounds were found to not interfere significantly with the MALDI 
analysis. To examine the specificity of the selected ribonucieases under these conditions, three synthetic 20-25mer 
RNA molecules with different nucleotide sequences were digested. 

[0452] The MALDI-MS spectra of Figure 77 shows five different cleavage patterns (A-E) of a 25 nt RNA obtained 
after partial digestion with RNases T^ U 2 PhyM, A, and alkaline hydrolysis. These spectra were taken from aliquots 
which were removed from the assay after empirically determined incubation times, chosen to get an optimum coverage 
of the sequence. As the resulting samples were not fractionated prior to mass spectrometry analysis, they contain all 
fragments generated at that time by the respective RNases. In practice, uniformity of the cleavages, can be affected 
by a preferential attack on the specific phophodiester bonds (Donis-Keller, H., Maxam, A. M. p and Gilbert, W. (1977) 
Nucleic Acids Res., 4, 1957-1978; Donis-kelter, H. (1980) Nucleic Acids Res., 8 3133-3142). The majority of the ex- 
pected fragments are indeed observed in the spectra. It is also worth noting that for the reaction protocols as used, 
correct assignment of all fragment masses is only possible, if a 2\ 3'-cyclic phosphate group is assumed. It is well 
known that such cyclic phosphates are intermediates in the cleavage reaction and get hydrolyzed in a second, inde- 
pendent and slower reaction step involving the enzyme (Richards, F. M., and Wycoff, H. W. In The Enzymes Vol. 4, 
3rd Ed., (ed. Boyer, P.D.) 746-806 (1971 , Academic Press, New York); Heinemann, U and W. Saenger (1985) Pure 
Appl Chem. 57, 41 7-422; Ikehara, M. et at., (1 987) Pure Appt, Chem. 59-965-968) Vreslow, R. and Xu, R. (1 993) Proc. 
Nat. Acad. Sci. USA, 90, 1201-1207). In a few cases different fragments have equal mass of differ by as little as 1 
Daiton., in these cases, mass peaks cannot unambiguously be assigned to one or the other fragments. Digestion of 
two additional different 20 nt RNA samples was, therefore, performed (Hahner, S., Kirpekar, R, Nordoff, E., Krlstiansen, 
K., Roepstorff , P. and Hillenkamp. F. (1 996) Proceedings of the 44th ASMS Conference on Mass Spectrometry, Port- 
land, Oregon) in order to sort out these ambiguities. For alt samples tested, the selected ribonucieases appear to 
cleave exclusively at the specified nucleotides leading to fragments arising from single as well as multiple cleavages. 
[0453] In Figure 77, peaks, indicating fragments containing the original 5'-terminus, are marked by arrows. All non 
marked peaks can be assigned to internal sequences or those with retained 3'-tenninus. For a complete sequence all 
possible fragments bearing exclusively either the 5 - or the 3'-terminus of the original RNA would suffice. In practice, 
the 5'-fragments are better suited for this purpose, because the spectra obtained after incubation of ail three synthetic 
RNA sampfes contain the nearly complete set of originals of 5'-ions for all different RNases (Hahner, S., Kirpekar, F., 
Nordoff, E., Kristiansen, K. f Roepstorff and Hillenkamp, F. (1 996) Proceedings of the 44th ASMS Conference on Mass 
Spectrometry, Portland, Oregon), internal fragments are somewhat less abundant and fragments containing the original 
3'-terminus appear suppressed In the spectra. In agreement with observations reported in the literature (Gupta, R. C. 
and Randerath, K. (1977) Nucleic Acids Res., 4, 1957-1978), cleavages close to the S'-terminus were partially sup- 
pressed in partial digests of the RNA 25 mer by RNase T, and U 2 (even if they are internal or contain the original 5'- 
terminus). Fragments from such cleavages appear as weak and poorly resolved signals in the mass spectra. 
[0454] For larger RNA molecules secondary structure is known to influence the uniformity of the enzymatic cleavages 
(Donis-Keller, H., Maxam, A. M. and Gilbert, A. (1977) Nucleic Acids Res. 8, 3133-3142). This can, in principle be, 
overcome by altered reaction conditions. In assay solutions containing 5-7 M urea, the activity of RNases such as T 2 , 
U 2 . A, Clg, and PhyM is known to be retained (Donis-Keller, H., Maxam, A. M. and Gilbert. W. (1977) Nucleic Acids 
Res., 4, 2527-2537; Boguskl, M. S., Hieter, P.A., and Levy, C. C. (1980) J. Biol. Chem.,255 t 2160-2163; Donis-Keller, 
H. (1980) Nucleic Acids Res., 8, 3133-3142, while RNA is sufficiently denatured. UV-MALDI-analysis with 3-HPA as 
matrix Is not possible under such high concentrations of urea in the sample. Up to a concentration of 2 M urea in the 
reaction buffer, MALDI analysis of the samples was still possible although significant changes in matrix crystallization 
were observed. Spectra of the RNA 20 mer (sample B), digested In the presence of 2 M urea still resembled those 
obtained under conditions listed in Table VII. 

[0455] Digestion by RNases which exclusively recognize one nucleobase is desirable to reduce the complexity of 
the fragment patterns and thereby facilitate the mapping of the respective nucleobase. RNases CL3 and cursavitin are 
enzymes reported to cleave at cytidylic acid residues. Upon limited RNase CL3 and cursath/in digestion of the RNA- 
20mer (sample B) under non-denaturing conditions, fragments corresponding to cleavages at cytidylic residues were 
indeed observed (Figure 78). Similar to the data reported so far (BoguskJ, M. S., Hieter, P. A. and Levy, C. C. (1980) 
J. Biol. Cfrem.,255,2160-2163: Rojo, M. A., Arias, F. J. r Iglesias, R., Ferreras, J. M., Munoz, R„ Escarmis, C, Soriano, 
F., Llopez-Fando, Mendez, E., and Girbes, T. (1994) Ptanta, 194, 328-338). The degradation pattern in Figure 78, 
however, reveals that not every cytidine residue is recognized, especially for neighboring C residues. RNase CL 3 is 
also reported to be susceptible to the influence of secondary structure (Boguskl, M. S., Hefter, P. A., and Levy, C. C. 
(19B0) J Biol. Chem. f 255, 2160-2163), but for RNA of the size employed In this study, such an influence should be 
negligible. Therefore, unrecognized cleavage sites in this case can be attributed to a lack of specificity of this enzyme. 
To confirm these data, a further RNase CL 3 -digestion was performed with the RNA 20mer (sample C). As a result of 
the sequence of this analyte, all three linkages containing cytidylic acid were readily hydrolyzed, but additional cleav- 
ages at uridyJic acid residues were detected as well. Since altered reaction conditions such as Increased temperature 
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(90°C), various enzyme to substrate ratios, and addition of 2M urea did not result in a digestion of the expected spe- 
cificity, application of this enzyme to sequencing was not pursued further. Introduction of a new cytidine-specific ribo- 
nuclease, cusatlvin, isolated form dry seeds of Cucumis sativus L looked promising for RNA sequencing (Rojo, M. A., 
Arias, F. J., Iglesias, FT, Ferreras, J. M., Munoz, R., Escarmis, C, Soriano, F., Llopez-Fando, J., Mendez. E. and Girbes, 
T. (1994) Plants, 194, 328-338). As shown in Figure 78, not every cytidine residue was hydrolyzed and additional 
cleavages occurred at urldyiic acid residues for the recommended concentration of the enzyme. RNases CUj and 
cusativin will, therefore not yield the desired sequence information for mapping of cytidine residues and their use was 
not further pursued. The distinction of pyrimidine residues can be achieved, however, by use of RNases with multiple 
specificities, such as Physarvm pofycephafum RNase (cleaves ApN, UpN) and pancreatic RNase A (cleaves UpN, 
CpN) (see Figure 77). All 5'-terminus fragments, generated by the monospecific RNase U 2 and apparent in the spectrum 
of Figure 77C were also evident in the spectrum of the RNase PhyM digest (Figure 77D). Five of the six uridilic cleavage 
sites could, this way, be uniquely identified by this indirect method. In a next step, the knowledge of the uridine cleavage 
sites was used to identify sites of cleavage of cytidiiic acid residues in the spectrum recorded after incubation with 
RNase A (Figure 77E), again using exclusively ions containing the original S'-terminus. Two of the four expected cleav- 
age sites were identified this way. A few Imitations are apparent from these spectra, if only the fragments containing 
the original B'-terminus are used for the sequence determination. The first two nucleotides usually escape the analysis, 
because their signals get lost in the low mass matrix background. Because of this, the corresponding fragments are 
missing in the spectra of the U- and C-specific cleavages. Large fragments with cleavage sites close to the 3'-termlnus 
are often difficult to identify, particularly in digests with RNases T 1 > and U 2 , because of their low yield (vide supra) and 
the often strong nearby signal of the non-digested transcript Accordingly the cleavages in position 22 and 23 do not 
show up in the spectrum of the G -specific RNase T, (Figure 77A) and the cleavage site 24 cannot be identified from 
the spectra of the U 2 and PhyM digests (Figures 77 C and D). Also site 1 6 and 1 7 with two neighboring cytidiiic acids 
cannot be identified in the RNase A spectrum of Figure 77E. These observations demonstrate that a determination of 
exclusively the 5'-terminus fragments may not always suffice and the Information contained in the interna! fragments 
may be needed for a full sequence analysis. 

[0456] Finalfy, limited alkaline hydrolysis provides a continuum of fragments (Figure 77B), which can be used to 
complete the sequence data. Again, the spectrum is dominated by ions of fragments containing the 5*-terminus, al- 
though the hydrolysis should be equal for all phosphodiester bonds. As was true for the enzymatic digests, correct 
mass assignments requires one to assume that all fragments have a 2', 3'-cyclic phosphate. The distribution of peaks, 
therefore, resembles that obtained after a 3'-exonuclease digest (Pieles, U M Zurcher, W., Schar, M. and Moser, H. E., 
(1 993) Nucleic Acids Res., 21 , 31 91 -31 96; Nordhoff , E. et §L (1 993) Book of Abstracts, 1 3* Internal Mass Spectrom. 
Conf., Budapest p. 218; Kirpekar, R, Nordhoff, E., Kristiansen, K., Roepstorff, P., Lezius, A. Hahner, S., Karas, M. and 
Hillenkamp. F. (1994) Nucleic Acid Res., 22, 3866-3870). In principle, the alkaline hydrolysis alone could, therefore, 
be used for a complete sequencing. This is, however, only possible for quite small oligoribonucleotides, because larger 
fragment ions, differing in mass by only a few mass units will not be resolved in the spectra and the mass of larger 
ions cannot be determined with the necessary accuracy of better than 1 Da, even if peaks are partially or fully resolved. 
The interpretation of the spectra particularly from digests of unknown RNA samples is substantially simplified, If only 
the fragments containing the original 5'-terminus are separated out prior to the mass spectrometry analysis. A proce- 
dure for this approach is described in the following section. 

[0457] Separation of S'-biotinylated fragments. Streptavidin-coated magnetic beads (Dynal) were tested for the ex- 
traction of fragments containing the original S'-terminus from the digests. Major features to be checked for this solid- 
phase approach are the selective immobilization and efficient elutton of biotinyiated species. In preliminary experiments, 
a 5*-biotinylated DNA (1 9 nt) and streptavidin were incubated and MALDI analyzed after standard preparation. Despite 
the high affinity of the streptavidln-blotln interaction, the intact complex was not found In the MALDI spectra. Instead, 
signals of the monomerfc subunit of streptavidin and the biotinyiated DNA were detected. Whether the complex dis- 
sociates in the acidic matrix solution (pKA 3) or during the MALDI desorption process, is not known. Surprisingly, if the 
streptavidin is immobilized on a solid surface such as magnetic beads, the same results are not observed. A mixture 
of two 5'- biotinyiated DNA samples (1 9 nt and 27 nt) and two unlabeled DNA sequences (1 2 nt and 22 nt) were incubated 
with the beads. The beads were extracted and carefully washed before incubation in the 3-HPA MALDI matrix. No 
analyte signals coukJ be obtained from these samples. To test whether the biotinyiated species had been bound to the 
beads altogether, elutlon form the extracted and washed beads was performed by heating at 90°C In the presence of 
95% formamide. This procedure is expected to denature the streptavidin , thereby breaking the streptavidin/biotin com- 
plex. Figure 798 shows the expected signals of the two biotinyiated species, proving that release of the bound molecules 
in the MALDI process is the problem rather than the binding of the beads; Figure 79A shows a spectrum of the same 
sample after standard preparation, showing signals of all four analytes as a reference. Complete removal of the for- 
mamide after the elution and prior to the mass spectrometric analysis was found to be important, otherwise crystalli- 
zation of the matrix is disturbed. Mass resolution and the slgnal-to-nolse ration in spectrum 79B are comparable to 
those of the reference spectrum. These results testify to the specificity of the streptavldin-biotin Interaction, since no 
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or only minor signals of the non-biotlnylatod analyte were detected after incubation with the Dynal beads. Increased 
suppression of nonspecific binding was reported through an addition of the detergent Tween-20 to the binding buffer 
(Tong, X. and Smith, L. M. (1 992) Anaf Chem., 64, 2672-2677). Although this effect could be confirmed in this study, 
peak broadening affected the quality of the spectra due to remaining amounts of the detergent. The necessity of an 
elution step as a prerequisite for detection of the captured biotinylated species can be attributed to a stabilizing effect 
of the complex by the immobilization of the streptavidin to the magnetic beads. 

[0458] For practical application of this solid phase method to sequencing a maximum efficiency of binding and elution 
of biotinylated species is of prime importance. Among a variety of conditions investigated so far, addition of salts such 
as EDTA gave best results in the case of DNA sequencing by providing ionic strength to the buffer (Tong, X. and Smith, 
L. M. (1992) Ana/Chem. t B4 t 2672-2677). To examine such an effect on the solid-phase method, several salt additives 
were tested for the binding and elution of the 5 '-biotinylated RNA in vitro transcript (49 nt). The results are shown in 
Figure 80. Judging from the relative intensity, signal-to-noise ration, and resolution of the respective signals, a 95% 
formamide solution containing 10 mM CDTA (Figure BOD) is most efficient for the binding/el ution. Since CDTA acts as 
a chelating agent for divalent cation, formation of proper secondary an tertiary structure of the RNA is prevented. An 
improved sensitivity and spectral resolution has been demonstrated under such conditions for the anafysis of RNA 
samples by electrospray mass spectrometry (Limbach, P. A., Grain, P. F. and McCloskey, J. A. (1 995) J Am. Soc. Mass. 
Spectrom., 6, 27-39). The improvement in the MALDI analysts is actually not very significant compared to the spectrum 
obtained for the solution containing formamide alone (Figure 81b), but the reproducibility for spectra of good quality 
was substantially improved for the CDTA/formamide solution. Thus in addition to the improved binding/elution, this 
additive may also improve the incorporation of the analyte into the matrix crystals. Unfortunately, a striking signal 
broadening on the high mass side was observed in case of formamide solutions containing EDTA, CDTA or 25% 
ammonium hydroxide. Since this effect is most prominent in case of 25% ammonium hydroxide and this agent was 
also used for adjusting EDTA and CDTA to their optimum pH, a pronounced NH 3 adduct ion formation ca be assumed. 
[0459] The applicability of streptavi din-coated magnetic beads separation to RNA sequencing was demonstrated for 
the Rnase U 2 digest of the5'-biotinylated RNA in wf/otranscript (49 nt) (Figure 81 ). The entire fragment pattern obtained 
after incubation with Rnase U 2 is shown is spectrum 81 A. Separation of the biotinylated fragments reduces the com- 
plexity of the spectrum (Figure 81 B) since only S'-terminal fragments are captured by the beads. The signals in the 
spectrum are broadened and the increased number of signals in the low mass range Indicate that even after stringent 
washing of the beads, some amounts of buffer and detergent used for the binding and elution remained. Further im- 
provements of the method are, therefore, needed. Another possible strategy for application of the magnetic beads is 
the immobilization of the target RNA prior to RNase digestion by an elution of the remaining fragments for further 
analysis. Cleavage of the RNA was impeded in this case, as evidenced by a prolonged reaction time for the digest 
under otherwise identical reaction conditions. 

EXAMPLE 22 

Parallel DNA Sequencing Mutation Analysis and Mlcrosatellrte Analysis Using Primers wfth Tags and Mass 
Spectrometrfc Detection 

[0460] This EXAMPLE describes specific capturing of DNA products generated in DNA analysis. The capturing is 
mediated by a specific tag (5 to 8 nucleotides long) at the 5' end of the analysis product that binds to a complementary 
sequence. The capture sequence can be provided by a partially double stranded oligonucleotide bound to a solid 
support. Different DNA analysis (e.g., sequencing, mutation, diagnostic, microsatellite analysis) can be carried out in 
parallel, using, for example, a conventional tube or microtiter plate (MTP). The products are then specifically captured 
and sorted out via the complementary identification sequence on the tag oligonucleotide. The capture oligonucleotide 
can be bound onto a solid support (e.g., silicon chip) by a chemical or biological bond. Identification of the sample is 
provided by the predefined position of the capute oligonucleotide. Purification, conditioning and analysis by mass spec- 
trometry are done on solid support. This method was applied for capturing specific primers that had a 6 base tag 
sequence. 

MATERIALS AND METHODS 

Genomic DNA. 

[0461] Genomic DNA was obtained from healthy individuals. 
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PCR Amplification 

[0462] PCR amplifications of part of the 0-globin gene were established using 02 d(CATTTGCTTCTGACACAACT 
Seq. ID. No. 66) as forward primer and p1 1 d(TCTCTGTCTCCACATGCCCAG Seq. ID. No. 67) as reverse primer. The 
total PCR volume was 50 u.l including 200 ng genomic DNA, 1 U Taq-polymerase (Boehringer-Mannheim, Cat# 1 59594), 
1 .5 mM MgC^, 0.2 mM dNTPs (Boehringer-Mannheim, Cat# 1277049), and 1 0 pmol of each primer. A specific fragment 
of the 0-globin gene was amplified using the following cycling conditions: 5 min @ 94 P C followed by 40 cycles of 30 
sec <& 94'C, 45 sec @ 53°C, 30 sec @ 72°C, and a final extension of 2 min @ 72°C. Purification of the amplified 
product and removal of unincorporated nucleotides was carried out using the QIAquick purification kit (Qiagen, Cat 
28104). One fifth of the purified product was used for the primer oligo base extension (PROBE) or sequencing reactions, 
respectively. 

Primer oligo base extension (PROBE) and sequencing reactions 

[0463] Detection of putative mutations in the human 0-globin gene at codon 5 and 6 and at codon 30 and in the I VS- 
1 donor site, respectively, was done in parallel (FIGURE 82A). p-TAG1 (GTCGTCCCATGGTGCACCTGACTC Seq. 
ID. No. 68) served as primer to analyze codon 5 and 6 and 0-TAG2 (CGCTGTGGTGAGGCCCTGGGCA Seq. ID. No. 
69) for the analyses of codon 30 and the IVS-t donor site. The primer oligo base extension (PROBE) reaction was 
done by cycling, using the following conditions: final reaction volume was 20 u.l, 0-TAG1 primer (5 pmol), 0-TAG2 primer 
(5 pmol), dCTP, dGTP, dTTP, (final concentration each 25 uM), ddATP (final concentration 1 00 uM) dNTPs and ddNTPs 
purchased from Boeringer-Mannheim, Cat# 1277049 and 1008382), 2 u.1 of lOx ThermoSequence buffer and 2.5 U 
ThermoSequenase (Amersham, CAT#E79000Y), The cycling program was as follows: 5 min @ 94*C, 30 sec @ 53°C, 
30 sec @ 72°C and a final extension step for 8 min @ 72°C. Sequencing was performed under the same conditions 
except that the reaction volume was 25 ul and the concentration of nucleotides was 250 uM for ddNTP. 

Capturing using TAG sequence and sample preparation 

[0464] The capture oligonucleotides cap-tag1 d(GACGACGACTGCTACCTGACTCCA Seq ID No. 70) and cap-tag2 
d(ACAGCGGACTGCTACCTGACTCCA Seq ID No. 71), respectively, were annealed to equimolar amounts of uni-as 
d(T G GAGTCAGGTAGCAGTC Seq ID No. 72) (FIGURE 82A). Each oligonucleotide had a concentration of 10 pmol/ 
U.I in ddH 2 0 and incubated for 2 min @ 80°C and 5 min @ 37°C. This solution was stored at -20°C and aliquots were 
taken. 10 pmol annealed capture oligo nucletides were bound to 10 u.l paramagnetic beads coated with streptavldin 
(10 mg/m!; Dynal, Dynabeads M-280 streptavldin Cat# 1 12.06) by incubation for 30 min <& 37°C. Beads were captured 
and the PROBE or sequencing reaction, respectively, was added to the capture oligonucleotides. To facilitate binding 
of 0-TAG1 abd 0-TAG2, respectively, the reaction was incubated for 5 min © 25° C and for 30 min @ 1 6»C. The beads 
were washed twice with ice cold 0.7 M NH 4 Citrate to wash away unspecific bound extension products and primers. 
The bound products were dissolved by adding 1 u.l DDH z O and incubation for 2 min @ 65*C and cooling on ice. 0.3u.l 
of the sample were mixed with 0.3u,l matrix solution (saturated 3-hydroxypicolinic acid, 1 0% molar ratio ammonium- 
citrate in acetonitrile/water (50/50. v/v)) and allowed to air dry. The sample target was automatically introduced into 
the source region of an unmodified Perspective Voyager MALDI-TOF operated in deJayed extraction linear mode with 
5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M r (calc)) were 
calculated from atomic compositions; reported experimental M^(M r (exp)) values are those of the singly-pronated form. 

RESULTS 

[0465] Specific capturing of a mixture of extension products by a short complementary sequence has been applied 
to isolate sequencing and primer oligo base extension (PROBE) products. This method was used for the detection of 
putative mutations in the human p-globin gene at codon 5 and 6 and at codon 30 and IVS-1 donor site, respectively 
(FIGURE 82A). Genomic DNA has been amplified using the primers 02 and pll . The amplification product was purified 
and the nucleotides separated. One fifth of the purified product was used for analyses by primer oligo base extension. 
To analyze both sites In a single reaction, primers, p-TAG1 and p-TAG2, were used respectively. 0-TAG1 binds upstream 
of codons 5 and 6 and 0-TAG2 upstream of codon 30 and the IVS-1 donor site. Extension of these primers was per- 
formed by cycling in the presence of ddATP and dCTP, dGTP and d.TTP, leading to specific products, depending on 
the phenotype of the individual. The reactions were then mixed with the capture oligonucleotides. Capture oligonucle- 
otides include the biotinytated capture primer cap-tag 1 and cap-tag2, respectively. They have 6 bases at the 5' end, 
that are complementary to the 5' end of p-TAG1 and 0-TAG2, respectively. Therefore, they specifically capture these 
primers and the extended products. By annealing a universal oligonucleotide (uni-as) to the capture oligonucleotide, 
the capture primer fe transformed into a partially double stranded molecule where only the capture sequence stays 
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single stranded (Figure 82). This molecule is then bound to streptavidin coated paramagnetic particles, to which the 
PROBE or sequencing reaction, respectively is added. The mixture was washed to bind only the specifically annealed 
oligonucleotides. Captured oligonucleotides are dissolved and analyzed by mass spectrometry. 
[0466] PROBE products of one individual (Fig. 83) show a small peak with a molecular mass of 7282.8 Da. This 
corresponds to the unextended 0-TAG1 that has a calculated mass of 7287.8 Da. The peak at 8498.6 Da corresponds 
to a product, that has been extended by 4 bases. This corresponds to the wildtype situation. The calculated mass of 
this product Is 8500.6 Da. There is no significant peak indicating a heterozygote situation. Furthermore only p-TAG1 
and not (J-TAG2 has been captured, indicating a high specificity of this method. 

[0467] Analyses of what was bound to cap-tag2 (Figure 84) shows only one predominant peak with a molecular 
mass of 9331,5 Da. This corresponds to an extension of 8 nucleotides. It Indicates a homozygous wildtype situation 
where the calculated mass of the expected product fs 9355 Da. There is no significant amount of unextended primer 
and only p-TAG2 has been captured. 

[0468] To prove that this approach is also suitable for capturing specific sequencing products, the same two primers 
p-TAG1 and 0-TAG2, respectively, were used. The primers were mixed, used in one sequencing reaction and then 
sorted by applying the above explained method. Two different termination reactions using ddATP and ddCTP were 
performed with these primers (Figures 85 and 86, respectively). All observed peaks in the spectrograms correspond 
to the calculated masses in a wildtype situation. 

[0469] As shown above, parallel analysis of different mutations (e.g. , different PROBE primers) is now possible. 
Further, the described method is suitable for capturing specific sequencing products. Capturing can be used for sep- 
aration of different sequencing primers out of one reaction tube/well, isolation of specific multiplex-amplified products, 
PROBE products, etc. Conventional methods, like cycle sequencing, and conventional volumes can be used. A uni- 
versal chip design permits the use of many different applications. Further, this method can be automated for high 
throughput. 

EXAMPLE 23 

Deletion Detection by Mass-Spectrometry 

[0470] Various formats can be employed for mass spectrometer detection of a deletion within a gene. For example, 
molecular mass of a double standard amplified product can be determined, or either or both of the strands of a double 
stranded product can be isolated and the mass measured as described in previous examples. 
(0471) Alternatively, as described herein, a specific enzymatic reaction can be performed and the mass of the cor- 
responding product can be determined by mass spectrometry. The deletion size can be up to several tens of bases in 
length, still allowing the simultaneous detection of the wildtype and mutated allele. By simultaneous detection of the 
specific products, it is possible to identify in a single reaction whether the individual is homozygous or heterozygous 
for a specific allele or mutation. 

MATERIALS AND METHODS 

Genomic DNA 

[0472] Leukocyte genomic DNA was obtained from unrelated healthy individuals. 
PCR amplification 

[0473] PCR amplification of the target DNA was established and optimized to use the reaction products without a 
further purification step for capturing with streptavidin coated beads. The primers for target amplification and for PROBE 
reactions were as follows: 

[0474J CKRA-F:d(CAG CTC TCA TTT TCC ATA C SEQ ID. NO. 73) and CKRA-R bio: d(AGC CCC AAG ATG ACT 
ATC SEQ ID. NO. 74). CKR-5 was amplified by the following program: 2 mln @ 94*C, 45 seconds @ 52°C, 5 seconds 
@ 72°C, and a final extension of 5 minutes at 72°C. The final volume was 50 p.1 including 200 ng genomic DNA 1 U 
Taq-polymerase (Boehrlnger-Mannheim, Cat # 1596594), 1 .5 Mm MgCfe, 0.2 Mm DNTPS (Boehringer-Mannheim, Cat 
# 1277049), 10 pmol of unmodified forward primers, and 8 pmol 5' blotlnylated reverse primer. 

Capturing and Denaturation of Biotinyfated Templates 

[0475] 10 uJ paramagnetic beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat # 
1 1 2.06) in 5x binding solution (5M NH 4 CI , 0.3 M NH 4 OH) were added to 45 u.l PCR reaction (5 u.l of PCR reaction were 



77 



BP 1 164 203 A2 

saved for electrophoresis). After binding by incubation for 30 min. at 37°C the supernatant was discarded. Captured 
templates were denatured with 50 u.l of 1 00 Mm NaOH for 5 min. at ambient temperature, washed once with 50 jlI 50 
Mm NH 4 OH and three times with 100 U.I 10 Mm Tris/Cl, Ph 8.0. The single stranded DNA served as templates for 
PROBE reactions. 

Primer Oligo Base Extension (PROBE) Reaction 

[0476] The PROBE reaction was performed using Sequence 2.0 (USB Cat # E70775Z including buffer). dATP/DGTP 
and ddTTP were supplied by Boehringer-Mannheim (Cat # 1277049 and 1008382). d(CAG CTC TCATTTTCC ATA 
C (SEQ ID. NO. 73) was used as PROBE primer (Figure 87). The following solutions were added tot he beads: 3.0 uJ 
H 2 0, 1-0 reaction buffer. 1 .0 uJ PROBE primer (1 0 pmol) and incubated at 65°C for 5 minutes followed by 37°C for 
10 min. Then 0.5 uJ DTT, 3.5 uJ DNTPS/ddntp each 50 uJvl and 0.5 uJ Sequenase (0.8 U) were added and incubated 
at37°Cfor 10 min. 

T4 Treatment of DNA 

[0477J To generate blunt ended DNA, amplification products were treated with T4 DNA polymerase (Boehringer- 
Mannheim Cat# 1 004786). The reactions were carried out according to the manufacturer's protocol for 20 min. at 11 °C. 

Direct Size Determination of Extended Products 

[0478] To determine the size of the amplified product, MALDI-TOF was applied to one strand of the amplification 
product, samples were bound to beads, as described above, conditioned and denatured, as described below. 

DAW Conditioning 

[0479] After the PROBE reaction the supernatant was discarded nd the beads were washed first In 50 u.t 700 mM 
NH 4 -citrate and second 50 u.l 50 mM NH 4 -citrate. The generated diagnostic products were removed for the template 
by heating the beads in 2 uJ H s O at 80*C for 2 min. The supernatant was used for MALDI-TOF analysis. 

Sample Preparation and Analysis with MALDI-TOF Mass Spectrometry 

[0480] Sample preparation was performed by mixing 0.6 uJ of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M 
dibasic citrate in 1:1 h^OrCHaCN) with 0.3 u.1 of diagnostic PROBE products In water on a sample target and allowed 
to air dry. Up to 1 00 samples were spotted on a probe target disk for introduction into the source region of an unmodified 
Perspective Voyager MALDI-TOF Instrument operated in linear mode with delayed extraction and 5 and 30 kV on the 
target and conversion dynode, respectively. Theoretical average molecular mass (M,(calc)) of analytes were calculated 
from atomic compositions, reported experimental M r (M r (exp)) values are those of the singly-pronated form, determined 
using internal calibration with unextended primers in the case of PROBE reactions. 

Conventional Analyses 

[0481] Conventional analyses were performed by native polyacryfamlde gel electrophoresis according to standard 
protocols. The diagnostic products were denatured with formamide prior to loading onto the gels and stained with 
ethidfum bromide or silver, respectively. 

RESULTS 



[0482] The CKR-5 status of 1 0 randomly chosen DNA samples of healthy individuals were anaiyzed. Leukocyte DNA 
was amplified by PCR and an aliquot of the amplified product was analyzed by standard polyacrylamide gel electro- 
phoresis and silver staining of the DNA (Figure 88). Four samples showed two bands presumably indicating hetero- 
zygosity for CKR-5, whereas the other 6 samples showed one band, corresponding to a homozygous gene (Figure 
88). In the case where two bands were observed, they correspond to the expected size of 75 bp for the wildtype gene 
and 43 bp for the allele with the deletion (Figure 87). Where one band was observed, the size was about 75 bp which 
indicated a homozygous wildtype CKR-5 allele. One DNA sample derived from a presumably heterozygous one from 
a homozygous individual were used for all further analysis. To determine the molecular mass of the amplified product, 
DNA was subjected to matrix assisted laser desorption/ionization coupled with time of flight analysis (MALDI-TOF)! 
Double stranded DNA, bound to streptavldin coated paramagnetic particles, was denatured and the strand released 



78 



EP 1 164 203 A2 

into the supernatant was analyzed. Figure 89A shows a spectrograph of a DNA sample, that was supposed to be 
heterozygous according to the result derived by potyacrylamide gel electrophoresis (Figure 88). The calculated mass 
of the sense strand for a wildtype gene is 23036 Da and for the sense strand carrying the deletion allele 131 43 (Figure 
87 and Table VI). Since many thermostable polymerases unspecifically add an adenosine to the 3' end of the product, 
those masses were also calculated. They are 23349 and 13456 Da. The masses of the observed peaks (Figure 89A) 
are 23119 Da, which corresponds to the calculated mass of a wildtype DNA strand where an adenosine has been 
added (23349 Da). Since no peak with a mass of about 23036 Da was observed, the polymerase must have qualitatively 
added adenosine. Two peaks, which are close to each other, have a mass of 13451 and 1 3137 Da. This corresponds 
to the calculated masses of the allele, with the 32bp deletion. The higher mass peak corresponds to the product, where 
adenosine has been added and the lower mass peak to the one without the un specific adenosine. Both peaks have 
about the same height, Indicating that to about half of the product adenosine has been added. The peak with a mass 
of 11682 Da is a doubly charged molecule of the DNA corresponding to 23319 Da (2 x 11682 Da = 23364 Da), The 
peaks with masses of 6732 and 6575 Da are doubly charged molecules of the one with masses of 13451 and 13137 
Da and the peak with 7794 Da corresponds to the triply charged molecule of 2331 9Da. Multiple charged molecules 
are routinely identified by calculation. Amplified DNA derived f mm a homozygous individual shows In the spectrograph 
(Figure 89C) one peak with a mass 23349.6 and a much smaller peak with a mass of 23039.9 Da. The higher mass 
peak corresponds to DNA resulting from a wildtype allele with an added adenosine, that has a calculated mass of 
23349 Da. The lower mass peak corresponds to the same product without adenosine. Three further peaks with a mass 
of 11686, 7804.6 and 5852.5 Da correspond to doubly, triply and quadruply charged molecules. 
[0483] The unspecific added adenine can be removed from the amplified DNA by treatment of the DNA and T4 DNA 
polymerase. DNA derived from a heterozygous and a homozygous individual was analyzed after T4 DNA polymerase 
treatment. Figure 89B shows the spectrograph derived from heterozygous DNA. The peak corresponding to the wildtype 
strand has a mass of 23008 Da indicating that the added adenine had been removed completely. The same is observed 
for the strand with a mass of 13140 Da. 

The other three peaks are multiply charged molecules of the parent peaks. The mass spectrograph for the homozygous 
DNA shows one peak that has a mass of 23004 Da, corresponding to the wildtype DNA strand without an extra adenine 
added. All other peaks are derived from muitiply charged molecules of this DNA. The amplified products can be analyzed 
by direct determination of their masses, as described above, or by measuring the masses of products, that are derived 
from the amplified product in a further reaction. In this "primer oligo base extension (PROBE)" reaction, a primer that 
can be internal, as it is in the nested PCR, or identical to one of the PCR primers, is extended for just a few bases 
before the termination nucleotide is incorporated. Depending on the extension length, the genotype can be specified. 
CKRA-F was used as a PROBE primer, and dATP/dGTP and ddTTP as nucleotides. The primer extension is AGT in 
case of a wildtype template and AT In case of the deletion (Figure 87). The corresponding masses are 6604 Da for the 
wildtype and 6275 Da for the deletion, respectively. PROBE was applied to two standard ON As. The spectrograph 
(Figure 90A) shows peaks with masses of 6604 Da corresponding to the wildtype DNA and at 6275 Da corresponding 
to the CKR-5 deletion allele (Table VIII). The peak at a mass of 5673 Da corresponds to CKRA-F (calculated mass of 
5674 Da). Further samples were analyzed in analogous way (Figure 90B). It Is unambiguously identified as homozygous 
DNA, since the peak with a mass of 6607 Da corresponds to the wildtype allele and the peak with a mass of 5677 Da 
to the un extended primer. No further peaks were observed. 

[0484) The example demonstrates that deletion analysis can be performed by mass spectrometry. As shown herein, 
the deletion can be analyzed by direct detection of single stranded amplified products, or by analysis of specifically 
generated diagntlc products (PROBE). In addition, as shown in the following Example 26, double stranded DNA am- 
plified products can be analyzed. 



J Size 


Calculated Mass 


{ 1 j 
Measured Mass a 


I wildtype w/o A 


23036 


23039/23009/23004 


wildtype with A 


23349 


23319/23350 


deletion w/o A 


13143 


13137/13139 


deletion with A 


13456 


13451 


PROBE 






wildtype 


6604 


6604/6608 | 


deletion 


6275 


6275 | 



All masses are In Dalton. 
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EXAMPLE 24 



Pentaptex tc-PROBE 

SUMMARY 

[0485] The multiplexing of thermocycting primer ollgo base extension (tc-PROBE) was performed using five poly- 
morphic sites in three different apolipoprotein genes, which are thought to be involved in the pathogenesis of athero- 
sclerosis. The apo lipoprotein A IV gene (codons 347 and 360), the apolipoproteln E gene (codons 112 and 158), and 
the apo lipoprotein B gene (codon 3500) were examined. AIJ mass spectra were easy to interpret with respect to the 
five polymorphic sites. 

MATERIALS AND METHODS 

PCR Amplification 

[0486] Human leukocytic genomic DNA was used for PCR. Listed below are the primers used for the separated 
amplification of portions of the Apo A IV, Apo E and the Apo B genes: 



Apo A IV: A347F: 5'-CGA GGA GCT CAA GGC CAG AAT-3' 

(SEQ ID NO. 75) 

A360 R-2-bio: # 5'-CAG GGG CAG CTC AGC TCT C-3' 

(SEQ IO NO. 76) 

Apo E; ApoE-F: 5'-GGC ACG GCT GTC CAA GGA-3' 

(SEQ ID NO. 77) 

ApoE-R bio: # 5'-AGG CCG CGC TCG GCG CCC TC-3' 

(SEQ ID NO. 78) 

Apo B: ApoB-F2 bio: # 5'-CTT ACT TGA ATT CCA AGA GC-3' 

(SEQ ID NO. 79) 

Apo B-R: 5'-GGG CTG ACT TGC ATG GAC CGG A-3' 

(SEQ ID NO. 80) 

* biotinyfated 



[0487] Taq polymerase and 10x buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs for Phar- 
macia (Freiburg, Germany). The total PCR reaction volume was 50 u.l including 1 0 pmol of each primer and 1 0% DMSO 
(dimethylsulfoxide, Sigma) (no DMSO for the PCR of the Apo B gene), with -200 mg of genomic DNA used as template 
and a final dNTP concentration of 200 pM. Solutions were heated to 80°C before the addition of 1 U Taq polymerase; 
PCR conditions were: 5 min at 95*0, followed by 2 cycles 30 sec 94°C, 30 sec 62 a C, 30 sec 72°C, 2 cycles 30 sec 
94*C 30 sec 58*0, 30 sec 72°C, 35 cycles of 30 sec at 94*C, 30 sec at 56*C, 30 sec at 72°C, and a final extension 
time of 2 min at 72°C. To remove unincorporated primers and nucleotides, amplified products were purified using the 
"QiAqutek" (Qiagen, Germany )kit, with elution of the purified products in 50jiL of TE buffer (10 mM Tris-HCI, 1mM 
EDTA, pH 8.0). 

Binding of the amplified product on beads 

[0488] 1 0 u.l of each purified amplified product was bound to 5uJ DynaBeads (Dynal, M -280 Streptavidin) and dena- 
tured according to the protocol from Dynal. For the pentaplex tc-PROBE reaction the three different amplified product 
(bound on the beads) were pooled. 

Tc-PROBE 

[0489] For the PROBE reaction the following primers were used: 



80 



EP 1 164 203 A2 

(Apo A) P347: 5'-AGC CAG GAC AAG-3* (SEQ ID NO. 81) 
(Apo A) P360: 5'-ACA GCA GGA ACA GCA-3' (SEQ ID NO. 82) 
(Apo E) P112: 5'-GCG GAC ATG GAG GAC GTG-3* (SEQ ID NO. 83) 
(Apo E) P158: S'-GAT GCC GAT GAC CTG CAG AAG-3'(SEQ ID NO. 84) 
(Apo B) P3500: 5'GTG CCC TGC AGC TTC ACT GAA GAC-3»(SEQ ID NO. 85) 

[0490] The tc-PROBE was carried out In a final volume of 25 |xl containing 1 0 pmol of each primer listed above, 2.5 
U Thermoquenase (Amersham), 2.5 pL Thermoquenase buffer, and 50 pAI dTTP (final concentrations) and 200 jiM 
of ddA/C/GTP, respectively. Tubes containing the mixture were placed in a thermocycler and subjected to the following 
cycling conditions: dematu ration (94 W C) the supernatant was carefuiiy removed from the beads and 'desalted" by ethanol 
precipitation to exchange nonvolatile cations such as Na+ and K+ with NH 4 +, which evaporated during the ionization 
process; 5 pL 3M ammonium acetate (pH 6.5) 0.5 u,L glycogen (1 0 mg/mL, Sigma), 25 u,L H 2 0, and 11 0 pL absolute 
ethanol were added to 25 pL PROBE supernatant and incubated for 1 hour at 4*C. After a 10 min. centrifugation at 
13,000 X g, the pellet was washed in 70% ethanol and resuspended in 1 pL 18 MohnVcm H 2 0. A 0.35 pL aliquot of 
resuspended DNA was mixed with 0.35 pL matrix solution (0.7 M 3-bydroxypicotinlc acid (3-HPA), 0.07 M ammonium 
citrate in 1 :1 H 2 0:CH 3 CN) on a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition 
using the Thermo Bioanalysls Version 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target 
and conversion dynode, respectively, Theoretical average molecular masses (M 1 (calc)) of the fragments were calcu- 
lated from atomic compositions. External calibration generated from synthetic (ATCG) n oligonucleotide (3.6-1 8kDa) 
was used. Positive ion spectra from 1-37500 Da were coliected. 

RESULTS 

[0491] Table VIII shows the calculated molecular masses of all possible extension products including the mass of 
the primer itself. Fig. 91 shows a respective MALDI-TOP MS spectra of a tc-PROBE using three different templates 
and 5 different PROBE primers simultaneously in ne reaction. Comparison of the observed and calculated masses 
(see table VIII) allows a fast genetic profiling of various polymorphic sites in an individual DNA sample. The sample 
presented In Figure 91 Is homozygous for threonine and glutamfne at position 347 and 360, respectively, in the apol- 
ipoprotein A IV gene, bears the epsiion 3 allele homozygous in the apolipoprotein E gene, and is also homozygous at 
the codon 3500 for arginine In the apolipoprotein B gene. 



TABLE VUI 





[ SEQ ID | mass | allele 


Apolipoprotein A IV 


5'-AGCCAGGACAAG-3' (347) 


86 


3688.40 


unextended 
primer 


5'-AGCCAGGACAAGTC-3' 


87 


4265.80 


347Ser 


5 ' - AGCC AGG AC AAG A-3' 


88 


3985.60 


347Thr 


5'-ACAGCACCAACAGCA~3'{360) 


89 


4604.00 


unextended 
primer | 


| 5'-ACAGCAGGAACAGCATC-3' 


90 


5181.40 


360His ] 
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10 



15 



20 



25 





SEQ ID 


mass 


allele | 


5'-ACAGCAGGAACAGCAG-3' (112) 


91 


4917.20 


360Gln B 


Apoltpoprotein E | 


5'-GCGGACATGGAGGACGTG-3' (112) 


92 


5629.60 


unextended fl 
primer | 


5'-GCGGACATGGAGGACGTGGC-3' 


93 


6247.00 


1 1 2Cy s g 


5'~GCGGACATGGAGGACGTGC~3' 


94 


5902.80 


112Arg I 


5'-GATGCCG ATGACCTGCAG AAG-3'( 1 58) 


95 


6480.20 


unextended R 
primer | 


5'-GATGCCGATGACCTGCAGAAGC-3' 


96 


6753.40 


158Arg | 


5'-GATGCCGATGACCTGCAGAAGTG-3' | 


97 


7097.60 


158Cys 


I Apolipoprotein B-lOO 


5'-GTGCCCTGCAGCTTCACTGAAGAC-3' 
I (3500) 


98 


7313.80 


unextended 
primer 


I 5' GTGCCCTGCAGCTTCACTGAAGACTG-3' 


99 


7931.20 


3500Gln 


j 5'-GTGCCCTGCAGCTTCACTGAAGACC-3' 


100 


7587.00 


35O0Arg 
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45 



50 
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Sequencing Exons 5 to 8 of the p53 Gene by MALDI-TOF Mass Spectrometry 

MATERIALS & METHODS 

[0492] Thirty-five cycles of PCR reactions were performed in a 96 well microliter plate with each well containing a 
total volume of 50 u.l including 200 ng genomic DNA. 1 unit Taq DNA polymerase, 1 .5 mM Mg C1 2 , 0.2mM dNTPx, 1 0 
pmol of the forward primer and 6 or 8 of the biotinylated reverse primer. The sequences of PCR primers prepared 
according to established chemistry (N.D. Sinha, J. Biemat, H. Kter, Tetrahed. Lett. 24:5843-5846 (1 983) are as follows: 
exon 5: d(bioti n-TATCTGTTCACTTGTGCCC SEQ ID NO. 101) and d(biotin-CAGAGGCCTGGGGACCCTG SEQ ID 
NO. 102); exon 6: D(ACGACAGGGCTGGTTGCC SEQ ID NO. 103) and d(biotin-ACTGACAACCACCCTTAAC SEQ 
ID NO. 1 04); exon 7: d(CTGCTTGCCACAGGTCTC SEQ ID NO. 105) and d(bk>tln-CACAGCAGGCCAGTGTGC SEQ 
ID NO. 106; exon 8: d(GGACCTGATTTCCTTACTG SEQ ID N0.107) and d(bk>tin-TGAATCTGAGGCATAACTG SEQ 
ID NO. 108), 

[0493] To each well of the 96-well microliter plate containing unpurified amplified product, 0.1 mg of paramagnetic 
streptavidin beads (Dynal) In 1 0 uJ of 5 x binding solution (5 M NH 4 OH) was added and incubated at 3 7»C for 30 min 
Then beads were treated with 0.1 M NaOH at room temperature for 5 min followed by one wash with 50 mM NH 4 OH 
at room temperature for 5 min followed by one wash with 50 mM Tris-HCI. 

[0494J Four dideoxy termination reactions were carried out in separate wells of the microliter plate. A total of 84 
reactions (21 primers x4 reactions/primer) can be performed In a single mfcroliter plate. To each well containing im- 
mobilized single-stranded template, a total volute of 10 |xl reaction mixture was added including 1x reaction buffer 10 
pmol of sequencing primer, 250 mM of dNTPs, 25 mM of one of the ddNTPs, and 1 -2 units of Thermosequenase 
(Amersham). Sequencing reactions were carried out on a thermal cycler using non-cycling conditions: 80°C 1 min 
50°C, 1 min, 50*C to 72°C, ramping 0.1 Vsec, and 72*C, 5 min. The beads were then washed with 0.7 M ammonium 
citrate followed by 0.05 M ammonium citrate. Sequencing products were then removed from beads by heating the 
beads to 80°C in 2 uJ of 50mM NH 4 OH for 2 min. The supernatant was used for MALDI-TOF MS analysis. 
[0495J Matrix was prepared as described in Kter. et a! (Kter, H. et a/., Nature Biotechnol. 14; 1123-1128 (1996)) This 
saturated matrix solution was then diluted 1 .52 times with pure water before use. 0.3 uJ of the diluted matrix solution 
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was then diluted 1 .52 times with pure water before use. 0.3 p. I of the diluted matrix solution was loaded onto the sample 
target and allowed to crystallize followed by addition of 0.3 u.f of the aqueous analyte. A Perseptive Voyager DE mass 
spectrometer was used for the experiments, and the samples were typically analyzed in the manual mode. The target 
and middle plate were kept at + 18.2 kV for 200 nanoseconds after each laser shot and then the garget voltage was 
raised to + 20kV. the ion guide wire in the flight tube was kept at - 2V. Normally, 250 laser shots were accumulated foe 
each sample. The original spectrum was acquired under 500 MHz digitizing rate, and the final spectrum was smoothed 
by a 455 point average (Savitsky and GoJay, (1964) Analytical Chemistry, 36:1627). Default caJibration of the mass 
spectrometer was used to identify each peak and assign sequences. The theoretical mass values of two sequencing 
peaks were used to recalibrate each spectrum. (D.R Little, T.J. Cornish, M.J. O'Donnel, A. Braun, R.J. Cotter, H. Kter, 
Anal. Chem., submitted). 

RESULTS 

[0496] Alterations of the p53 gene are considered to be a critical step in the development of many human cancers 
(Greenblatt, et aL. (1994) Cancer Res. 54, 4855-4878; C.C. Harris, (1996) J. Cancer, 73, 261-269; and D. Sidransky 
and M. Hollstein, (1996) Annu.Res.Med., 47,285-301). Mutations may serve as molecular indicators of clonality or as 
early markers of relapse in a patient with a previously identified mutation in a primary tumor (Hainaut, et a!.. (1997) 
Nucleic Acid Res., 25, 151-157). The prognosis of the cancer may differ according to the nature of the p53 mutations 
present (H.S. Goh et a!.. (1995) Cancer Res, 55, 5217-5221). Since the discovery of the p53 gene, more than 6000 
different mutations have been detected. Exons 5-8 were selected as sequencing targets where most of the mutations 
cluster (Hainaut et aL (1997) Nucleic Acids Res., 25, 151-7). 

[0497] Figure 96 schematically depicts the single tube process for target amplification and sequencing, which was 
performed, as described in detail in the Materials and Methods. Each of exon 5-8 of the p53 gene was PGR amplified 
using flanking primers in the intron region; the down stream primer was biotinylated. Amplifications of different exons 
were optimized to use the same cycling profile, and the products were used without further purification. PCR reactions 
were performed in a 96 well microliter plate and the product generated in one well was used as the template for one 
sequencing reaction. Streptavidin-coated magnetic beads were added to the same microliter plate and amplified prod- 
ucts were immobilized. The beads were then treated with NaOH to generate immobilized single-stranded DNA as 
sequencing template. The beads were washed extensively with Tris buffer since remaining base wouJd reduce the 
activity of sequencing enzyme. 

[0498] A total of 21 primers were selected to sequence exon 5-8 of the p53 gene by primer walking. The 3'-end 
nucleotide of all the primers is located at the site where no known mutation exists. Four termination reactions were 
performed separately which resulted in a total of 84 sequencing reactions on the same PCR microliter plate. Non- 
cycling conditions were adopted for sequencing since streptavidin coated beads do not tolerate the repeated application 
of high temperature. Sequencing reactions were designed so that mt terminated fragments were under 70 nucleotides, 
a size range easily accessible by MALDI-TOF MS and yet long enough to sequence through the next primer binding 
site. Thermequenase was the enzyme of choice since it could reproducible generate a high yield of sequencing products 
in the desired mass range. After the sequencing reactions, the beads were washed with ammonium ion buffers to 
replace all other cations. The sequencing ladders were then removed from the beads by heating in ammonium hydrox- 
ide solution or simply in water. 

[0499] A sub-microliter aliquot of each of the 84 sequencing reactions was loaded onto one MS sample holder con- 
taining preloaded matrix. Figure 94 gives an example of sequencing data generated from one primer; four spectra are 
superimposed. 

[0500] All sequencing peaks were well resolved in the mass range needed to read through the next sequencing 
primer site. Sometimes doubly charged peaks were observed which could be easily identified by correlating the mass 
to that of the singly charged Ion. False stops generated by early termination of the enzymatic extension can be observed 
cle to the primer site. Since the mass resolution is high enough, it Is easy to differentiate the false stop peaks from the 
real sequencing peaks by calculating the mass difference of the neighboring peaks and crs comparing the four spectra. 
Additionally, mt primers generated detectable data through the region of the downstream primer binding site thereby 
covering the false stop region. 

[0501] Using optimized procedures of amplification, sequencing, and conditioning, exons 5-8 of the p53 gene were 
successfully sequenced. Correct wlldtype sequence data were obtained from all exons with a mass resolution about 
300 to BOO over the entire mass range. The overall mass accuracy is 0.05% or better. The average amount of each 
sequencing fragment loaded on the MS sample holder is estimated to be 50 fmol or less. 

[0502] This example demonstrates the feasibility of sequencing exons of a human gene by MALDI-TOF MS. Compare 
to gel-based automated fluorescent DNA sequencing, the read lengths are shorter. Microchip technology can be in- 
corporated to provide for parallel processing. Sequencing products generated in the microtiter plate can be directly 
transferred to a microchip which serves as a launching pad for MALDI-TOF MS analysis. Robot-driven serial and 



83 



EP 1 164 203 A2 



parallel nanoliter dispensing tools are being used to produce 100-1000 element DNA arrays on < 1 M square chips with 
flat or geometrically altered (e.g. , with wells) surfaces for rapid mass spectrometric analysis. 

[0503] Figure 94 shows an MS spectrum obtained on a chip where the sample was transferred from a microtiter 
plate by a pintool. The estimated amount of each termination product loaded is 5 fmol or less which is in the range of 
amounts used in conventional Sanger sequencing with radiolabeled or fluorescent detection (0.5-1 fmol per fragment). 
The low volume MALDI sample deposition has the advantages of miniaturization (reduced reagent cts), enhanced 
reproducibility and automated signal acquisition. 

EXAMPLE 26 

Direct detection of synthetic and biologically generated double-stranded DNA by MALDI-TOF MS 
Introduction 

[0504] Typically, matrix-associated laser desorption/lonization (Karas, et. al. , ( 1 989) Int. J. Mass Spectrom, Ion Proc- 
essor 92, 231) time-of-flight mass spectrometry (MALDI-TOF MS) of DNA molecules which are double stranded (ds) 
in solution yields molecular ions representative of the two single stranded components (Tang, et at. (1994) Rapid 
Commun. Mass Spectrom. B:183;Tang t et al. (1995) Nucleic Adds Res. 23:3126; Benner, etal. (1995) Rapid Commun. 
Mass Spcctrom. 9:537; Liu, etal. (1 995) Anal. Chem. 67:3482; Siegert etal. (1996) Anal. Biochem. 243 :55; and Dok- 
tycz, etal (1 995) Anal. Biochem. 230 :205); this has been observed in several reports dealing with biologically generated 
DNA from a polymerase chain reaction (PCR) amplification (Tang, et al. (1994) Rapid Commun. Mass Spectrom. 8: 
183; Liu, et al. (1995) Anal. Chem. 67 :3482; Siegert et al. (1996) Anal. Biochem. 243:55; and Doktycz, et al. (1995) 
Anal. Biochem. 230:205). It is not clear whether the double strand is destabilized because of the decreased pH in the 
matrix environment or because of absorbance by the duplex during desorptlon/ionization/acceleration of an energy 
sufficient to overcome the attractive van der WaaJs and "stacking* stabilization forces (Cantor and Shlmmel, Biophysical 
Chemistry Part I: The conformation of Biomolecules , W.H. Freeman, New York, (1980), 176). When analyte is present 
at high concentrations formation of non-specific gas-phase DNA multimers is, as with proteins (Karas, et. al., (1989) 
Int. J. Mass Spectrom, Ion Processes 92:231). common; however, Lecchiand Pannefl (Lecchi etal. (1995) J. Am. Soc. 
Mass Spectrom. 6:972) have provided strong evidence for specific Watson Crick (WC) base pairing being maintained 
in the gas phase. They detected these specific dimers when using 6-aza-2-thiothymine as a matrix, but did not observe 
them with 3-hydroxypicclinic acid (3-HPA) or 2,4,6- hydroxyacetophenone matrix. As described below, by using a low 
acceleration voltage of the ions and preparing samples for MALDI analysis at reduced temperatures, routine detection 
of dsDNA is possible. 

MATERIALS AND METHODS 

[0505] Synthetic DNA. Oligonucleotides were synthesized (Sinha, et al. (1 984) Nucterc Acids Res., 1 2, 4539) on a 
Perspective Expedite DNA synthesizer and reverse phase HPLC purified in-house. Sequences were: 50-mer (15337 
Da): 5'-TTG CGT ACA CAC TGG CCG TCG TTT TAC AAC GTC GTG ACT GGG AAA ACC CT-3' (SEQ ID NO. 109); 
27-mer c (complementary, 8343 Da): 5'-GTA AAA CGA CGG CCA GTG TGT ACG CAA-3' (SEQ ID NO. 110); 27-mer^ 
(non-complementary, 8293 Da): 5'-TAC TGG AAG GCG ATC TCA GCA ATC AGC-3' (SEQ ID NO. 1 1 1 ). 1 0OuJvl stock 
solutions were diluted to 20, 10, 5, and 2.5 uM using 18MohnVcm H 2 0. 2uL each of equimolar solutions of the 50-mer 
and either 27-mer c or 27-mer nc were mixed and allowed to anneal at room temperature for 1 0 minutes. 0.5u.L of these 
mixtures were mixed directly on a sample target with 1 u,L matrix (0.7 M 3-HPA, 0.07 M ammonium citrate in 50% 
acetonitrile) and allowed to air dry. 

Biological DNA. Enzymatic digestion of human genomic DNA from leukocytes was performed. PCR primers (forward, 
5'-GGC ACG GCT GTC CAA GGA G-3' (SEQ ID NO. 112)); reverse, 5'-AGG CCG CGC TCG GCG CCC TC-3' (SEQ 
ID NO. 113) to amplify a portion of exon 4 of the apolipoprotein E gene were delineated from the published sequence 
(Das et al., (1 985) J. Biol. Chem., 260 6240). Taq polymerase and 1 0x buffer were purchased from Boeh ringer- Man- 
nheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 pJ including 20 
pmol of each primer and 10% DMSO (dimethyisulfoxide, Sigma) with approximately 200 ng of genomic DNA used as 
template. Solutions were heated to 80°C before the addition of IU polymerase; PCR conditions were: 2 min at 94°C, 
followed by 40 cycles of 30 sec at 94*C, 45 sec at 63°C, 30 sec at 72°C, and a final extension time of 2 min at 72°C. 
While no quantitative data was collected to determine the final yield of amplified product, it is estimated that -2pmol 
were available for the enzymatic digestion. 

[0506] Cfoi and RsaJ and reaction buffer L were purchased from Boehringer- Mannheim. 20u.l of amplified products 
were diluted with 1 5u.l water and 4uJ buffer L; after addition of 1 0 units of restriction enzymes the samples were incubated 
for 60 min at 37*C. For precipitation of digest products 5uJ of 3M ammonium acetate (pH 6.5), (5u.l glycogen (Braun, 
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et at. (1997) Clin. Chem. 43:1151) (lOmg/ml, Sigma), and 110^1 absolute ethanol were added to 50u,L of the analyte 
solutions and stored for 1 hour at room temperature. After at 1 0 mln centrifugation at 1 3,000 X g, the pellet was washed 
In 70% ethanol and resuspended in 1u.l 18Mohm/cm H 2 0. 

[0507] Sample preparation and analysts by MALDI-TOF MS. 0.35uJ of resuspended DNA was mixed with 0.35-1 .3 
u.L matrix solution {0.7M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H 2 0:CH 3 CN) (Wu, el al. 
(1993) Rapid Commun. Mass Spectrom. 7:142) on a stainless steel sample target disk and allowed to air dry preceding 
spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDI-TOF instrument operated In pitlve ion reflectron 
mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular masses (M r 
(cafe)) of the fragments were calculated from atomic compositions; the mass of a proton (1 .08 Da) was subtracted from 
raw data values in reporting experimental molecular masses (M^exp)) as neutral basis. External calibration generated 
from eight peaks (2000-18000 Da) was used for all spectra. 

Results and Discussion 

[0508] Figure 96A is a MALDI-TOF mass spectrum of a mixture of the synthetic 50-mer with (non-complementary) 
27-mer^ (each 10 u,M, the highest final concentration used in this study); the laser power was adjusted to just above 
the threshold irradiation for ionization. The peaks at 8.30 and 15.34 kDa represent singly charged ions derived from 
the 27- and 50-mer single strands, respectively. Poorly resolved low intensity signals at -16.6 and -30.7 kDa represent 
homodimers of 27-and 50-mer, respectively; that at 23.6 kDa is consistent with a heterodimer containing one 27-mer 
and one 50-mer strand. Thus low intensity dimer ions representing all possible combinations from the two non-com- 
plementary oligonucleotides (27+27; 27+50; 50+50) were observed. Increasing the irradiance even to a point where 
depurination peaks dominated the spectrum resulted in slightly higher intensities of these dimer peaks. Note that the 
hybridization was performed at room temperature and with a very low salt concentration, conditions at which non- 
specific hybridization may occur. 

[0509] Figure 96 shows a MALDI-TOF spectrum of the same 50-mer mixed with (complementary) 27-mer c ; the final 
concentration of each oligonucleotide was again 1 0u.M. Using the same laser power as in Figure 96A, intense signals 
were again observed at 88,34 and 15.34 Kda, consistent with single stranded 27- and 50-mer, respectively. Homodimer 
peaks (27 + 27; 50+ 50) were barely apparent in the noise; however, singly (23.68 Kda) and doubly (11 .B4k Da) charged 
heterodimer (27 + 50) peaks were dominant. Although the 23.68 Kda dimer peak could be detected from all irradiated 
positions, its intensity relative to the monomer peaks varied slightly from spot-to-spot. Repeating the experiment with 
individual oligonucleotide concentrations of 5, 2.5, and 1.25 u,M resulted in decreasing amounts of the 27-/50-mer 
Watson-Crick dimer peak relative to the 27- and 50-mer single stranded peaks. At the lowest concentrations, the ob- 
servation of dimer was ■crystal-dependent", that is, irradiation of some crystals produced significant 2 7-/50-mer dimer 
signal, while other crystals reproducibly yielded very little or none. This indicates that the incorporation of dsdna into 
the matrix crystals or the effectiveness of retaining this interaction through the ionization/desorption process is depend- 
ent upon the microscopic properties of the crystals, and/or that there exist steep concentration gradients of the duplex 
throughout the sample. 

[0510] Thus the Figure 96 spectra provide strong evidence that specific WC base paired dsdna can be observed 
using gentle laser conditions with high concentrations of oligonucleotides in this mass range, the first report of this 
using a 3-HPA matrix. The study was extended to a complex mixture of dsdna derived from an enzymatic digest (Rsal/ 
Cfol) of a region of exon 4 of the apolipoprotei n E gene (Das et. al. , (1 985) J. 8/67. Chem. t 260 6240); expected fragment 
masses are given In Table IX. 



Table IX 



|| Cfol/Rsal Digestion Products from ApoE gene exon 4 2 


| bases b 


SSDNA 


(Da) 


dsdna (Da) 


I (+) (-) 


(+) 


(-) 




II 11 13 


3428 


4025 


7453 


I 16 


5004 


4924 


9928 


18 


5412 


5750 


11162 


17 19 


5283 


5880 


11163 


| 19 [ 5999 


5781 


1,780 



f€3 allele has no 17/19 or 19/19 pairs; e4 allele contacts no 38/38 pair. 
(+) sense •trend, (-) antbtertse strand 
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Table IX (continued) 



Cfol/RsaJ Digestion Products from ApoE gene exon 4 2 


bases b 


ssDNA 


(Da) 


dsdna (Da) 


24 


22 


7510 


6745 


14225 


31 


29 


9628 


9185 


18813 


36 


38 


11279 


11627 


22906 j 


48 


14845 


14858 


29703 | 


56 


53 


17175 


16240 


33415 | 



&€3 allele has no 17/19 or 19/19 pairs; G4 aDete contains no 36/38 pair. 
b (+) sense strand. (•) anti sense strand 



After the digestion step, the samples were purified and concentrated by ethanol precipitation and resuspended in 1p.L 
H 2 0 before mixing them at room temperature with matrix on the sample target. Nearly 20 peaks ranging in mass from 
3.4-1 7.2 Kda were resolved in the products' MALDI spectrum (Figure 97A), all consistent with denatured single stranded 
components of the double strand (Table IX). Many such analyses of similar biological products over a period of months 
also yielded spectra with negligible dsdna, consistent with previous reports (Tang, eta/. (1994) Rapid Commun. Mass 
Spectrom. 8:183; Liu, etaL (1995) Anal. Chem. 67:3482; Siegert era/. (1996) Anal. Biochem. 243 :55; and Doktycz, et 
at. (1995) Anal. Biochem. 230:205); contrarily, intact double strands were observed under similar conditions for the 
synthetic DNA (Figure 96A). It is difficult to estimate the strand concentration available after the biological reactions, 
but presumably that ft was far lower than that at which dimerization of synthetic samples occurred. Furthermore, main- 
taining specific hybrids within the two -component synthetic mixture may be kineticafly favored relative to the far more 
complex mixture of 20 single-stranded DNA components from the digest. 

[051 1] The effect of reduced temperature on maintaining dsDNA was tested. An aliquot of the digested DNA solution, 
the matrix, pfpette, pipette tips, and the stainless steel sample target were stored in a 4°C "cold room" for 15 minutes; 
as with normal preparations matrix, and then analyte, were spotted on the target and allowed to co-crystallize while 
air drying. Crystallization for mixtures of 300 nL 3HPA (50% acetonitrrle) with 300 nl_ analyte required ~ 1 minute at 
room temperature but —15 minutes at the reduced temperature. Sample spots prepared in the cold room environment 
typically contained a high proportion of large transparent crystals. 

[0512] MALDI -TO F analysis of an ApoE digest aliquot prepared at reduced temperature produced the Figure 97B 
spectrum. While the low mass range appeared qualitatively similar to Figure 97A, dramatic differences above 8 kDa 
were observed. Only signals consistent with single strands (Table IX) were observed in Figure 97A, but the Figure 97B 
cold room prepared samples did not yield signals for the same masses except below 8 kDa. Even more striking were 
the additional high mass peaks In Figure 97B; clearly these represent dimer peaks containing lower mass components. 
As was done with the synthetic DNA. it was important to determine whether these represent non-specific heterodimers, 
specific WC heterodimers, or nonspecific homodimers. Consider first the 33.35 kDa fragment. Ignoring the unlikely 
possibility that the high mass fragment represents a trimer or higher multimer, as a dimer it must only contain the 
highest mass ss DNA components, Le., the >1 6 kDa. Homodimerization of the 1 5.24 and 1 7.1 8 kDa fragments would 
result In 32.49 and 34.35 kDa peaks, respectively; corresponding mass errors for these Incorrect assignments relative 
to the observed 33.35 kDa would be -2.6% and +3.0% respectively. A far better match is achieved if this peak originates 
from a heterodimer of the two highest mass single stranded fragments; their summed mass (1 6.24+1 7.1 8 = 33.42 kDa) 
differed by 0.2% from the observed dimer mass 33.35 kDa, an acceptable mass error for MALDI-TOF analysis of large 
DNA fragments using external calibration. Likewise, the 29 66kDa fragment was measured only 0. 1 3% lower than the 
29.70 Da expected for a heterodimer of 48-mers; the sum of no other posstole homodimers or heterodimers were within 
a reasonable range of this mass. Similar arguments could be made for the 22.89 and 1 8.83 kDa fragments, representing 
36-/38-mer and 31-/29-heterodimers, respectively; the signal at 14.86 kDa is consistent with singly charged single 
stranded and doubly charged double-stranded 48-mer. The agreement of the Figure 97B masses above 15 kDa with 
the of dsDNA expected from this digest and the absence of homodimers and non-specific heterodimers at random 
masses indicated that the base pairings were indeed highly specific and provided further evidence that gas-phase WC 
interactions may be retained in MALDI-generated ions. 

[0513] Figure 98 shows a MALDI-TOF spectrum of an G4 allele, which, unlike the G3, was expected to yield no 36-/ 
38-mer pair upon Cfol/Rsal digestion. The G3 and G4 mass spectra were similar except that abundant 22.89 kDa 
fragment in Figure 97B was not present in Figure 98; with this Information alone (Table IX) G3 and G4 alleles were 
easily distinguished, thereby demonstrating the genotyping by direct measurement of dsDNA by MALDI-TOF MS. 
Similarly dsDNA could be ionized, transferred to the gas phase, and detected by MALDI-TOF MS. The acceleration 
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voltage typically employed on our instrument was only - 5kV corresponding to 1 5kV/mm up to -2 mm from the sample 
target, with the electric field strength decreasing rapidly with distance from the sample target. Most previous work used 
at least 20kV acceleration (Lecchi etal. (1995) J. Am. Soc. Mass Spectrom. 6:972): in one exception a 27-mer dsDNA 
was detected using a frozen matrix solution and 1 00 V acceleration (Nelson, etal. (1 990) Rapid Commun. Mass Spec- 
trom. 4:348). Without being bound by any theory MALDI-induced "denaturation" of dsDNA may be due to gas-phase 
collisional activation that disrupts the WC pairing when high acceleration fields are employed, analogous to the dena- 
turation presumed to be a first step in the fragmentation used for sequencing the single stranded components of dsDNA 
using electrospray ionization (McLafferty et at. (1996) Int. J. Mass Spectrom., Ion Processes) . It appears that the high 
salt concentrations (typically > 10mM NaCI or KCI) required to stabilize WC paired dsDNA in solution are unsuitable 
for MALDI analysis (Nordhoff et a/. (1 993) Nucleic Acids Res. 21_:3347); reducing the concentration of such non-volatile 
cations Is necessary to avoid cation -adducted MALDI signals, but destabilizes the double strands in solution. The low 
pH conditions of the matrix environment should also destabilize the duplex. As shown in Figures 97B and 98, storing 
and preparing even low concentrations of the biological samples at reduced temperature at least in part offset these 
denaturing effects, especially for longer strands where melting temperatures are higher due to a more extensive hy- 
drogen bonding network. The conditions used here are recognized to be very non-stringent annealing conditions. 
[0514] The low mass tails on high mass dsDNA peaks ( e.g. , Figure 97B, 232 kDa) are consistent with depurinatlon 
generated to a higher extent than the sum of depurinatlon from each of the single strands combined. Although depu- 
rination in solution is an acid-catalyzed reaction, the weakly acidic conditions In the 3-HPA matrix do not induce sig- 
nificant depurinatlon; molecular ion signals from a mixed-base 50-mer measured with De-MALDI-TOF had only minor 
contributions from depurinatlon peaks (Juhaz, etal. (1 996) Anal. Chem. 68:941 ). Depurination from the single stranded 
components of the gas-phase dsDNA is observed even though these bases are expected to be hydrogen bonded to 
the complementary base of the accompanying strand, implying that covalent bonds are being broken before the strand 
is denatured. 

EXAMPLE 27 

Efficiency and Specificity Assay for Base-Specific Ribonucteases 

[0515] Aliquots sampled at regular time intervals during digestion of selected synthetic 20 to 25 mers were analyzed 
by mass spectrometry. Three of the RNAses were found to be efficient and specific. These include: the G-speciftc T 1f 
the A-specific U 2 and the A/U-specific PhyM. The ribonucleases presumed to be C-specific were found to be less 
reliable, e^., did not cleave at every C or also cleaved at U in an unpredictable manner. The three promising RNAses 
all yielded cleavage at all of the predicted positions and a complete sequence coverage was obtained. In addition, the 
presence of cleavage products containing one or several uncieaved positions (short incubation times), allowed align- 
ment of the cleavage products. An example of the MALDI-spectrum of an aliquot sampled after T, digest of a synthetic 
20-mer [SEQ ID NO:114J RNA is shown in Rgure 100, 

EXAMPLE 28 

Immobilization of amplified ONA targets to silicon wafers 
Silicon surface preparation 

[051 6] Silicon wafers were washed with ethanol, flamed over bunsen burner, and immersed in an anhydrous solution 
of 25% (by volume) 3-aminopropyltrlethoxysilane in toluene for 3 hours. The silane solution was then removed, and 
the wafers were washed three times with toluene and three times with dimethyl sulfoxide (DMSO). The wafers were 
then incubated in a 1 0mM anhydrous solution of N-succinimrdyl (4-iodoacetyl) aminobenzoate (SIAB) (Pierce Chemical, 
Rockford, IL) in anhydrous DMSO. Following the reaction, the SIAB solution was removed, and the wafers were washed 
three times with DMSO. In all cases, the iodoacetamido-functlonalized wafers were used immediately to minimize 
hydrolysis of the labile iodoacetamido-functlonaJfty. Additionally, all further wafer manipulations were performed In the 
dark since the iodoacetamlde-functlonality is light sensitive. 

Immobilization of amplified thioi-containlng nucleic acids 

[0517] The SIAB-conjugated silicon wafers were used to analyze specific free thiof-contalning DNA fragments of a 
particular amplified DNA target sequence. A 23-mer oligodeoxy nucleotide containing a 5'-dlsulfide linkage [purchased 
from Operon Technologies; SEQ ID NO: 1 1 7] that is complementary to the ^-region of a 1 1 2 bp human genomic DNA 
template [Qenebank Acc. No.: Z52259; SEQ ID NO: 118] was used as a primer in conjunction with a commercially 
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available 49-mer primer, which is complementary to a portion of the 5'-end of the genomic ONA [purchased from Operon 
Technologies; SEQ ID NO: 11 9], in PCR reactions to amplify a 135 bp DNA product containing a S'-disulfide linkage 
attached to only one strand of the DNA duplex [SEQ ID NO: 120]. 

[051 BJ The PCR amplification reactions were performed using the Amplitaq GoldKit [Perkin Elmer Catalog No. 
N808-0249]. Briefly, 200 ng 112 bp human genomic DNA template was Incubated with 10 \iM of 23-mer primer and 
Bu.M of commercially available 49-mer primer, 10 mM dNTPs, 1 unit of Amplitaq Gold DNA polymerase in the buffer 
provided by the manufacturer and PCR was performed in a thermocycler. 

[0519] The 5'-disulflde bond of the resulting amplified product was fully reduced using 10 mM tris-<2-carboxyethyl) 
phosphine (TCEP) (Pierce Chemical, Rockford, IL) to generate a free SMhiol group. Disulfide reduction of the modified 
oligonucleotide was monitored by observing a shift in retention time on reverse-phase FPLC. It was determined that 
after five hours in the presence of 10 mM TCEP, the disulfide was fully reduced to a free thiol. Immediately following 
disulfide cleavage, the modified oligonucleotide was incubated with the lodacetamido-functionalized wafers and con- 
jugated to the surface of the silicon wafer through the SIAB linker To ensure complete thiol deprotonation, the coupling 
reaction was performed at pH 8.0. Using 1 0mM TCEP to cleave the disulfide and the other reaction conditions described 
above, it was possible to reproducibly yield a surface density of 250 fmol per square mm of surface. 

Hybridization and MALDI-TOF Mass spectrometry 

[0520] The silicon wafer conjugated with the 135 bp thiol-containing DNA was incubated with a complementary 
12-mer oligonucleotide [SEQ ID NO: 121] and specifically hybridized DNA fragments were detected using MALDI-TOF 
MS analysis. The mass spectrum revealed a signal with an observed experimental mass-to-charge ratio of 3618.33; 
the theoretical mass-to-charge ratio of the 12-mer oligomer sequence is 3622.4 Da. 

[0521] Thus, specific DNA target molecule that contain a 5-disutfide linkage can be amplified. The molecules are 
immobilized at a high density on a SIAB-derivatlzed silicon wafer using the methods described herein and specific 
complementary oligonucleotides may be hybridized to these target molecules and detected using MALDJ-TOF MS 
analysis. 

EXAMPLE 29 

Use of High Density Nucleic Acid Immobilization to Generate Nucleic Acid Arrays 

[0522] Employing the high density attachment procedure described in EXAMPLE 28, an array of DNA oligomers 
amenable to MALDI-TOF mass spectrometry analysis was created on a silicon wafer having a plurality of locations, e. 

depressions or patches, on its surface. To generate the array, a free thlol-containing oligonucleotide primer was 
immobilized only at the selected locations of the wafer [e.g., see EXAMPLE 2B]. The each location of the array contained 
one of three different oligomers. To demonstrate that the different immobilized oligomers could be separately detected 
and distinguished, three distinct oligonucleotides of differing lengths that are complementary to one of the three oli- 
gomers were hybridized to the array on the wafer and analyzed by MALDI-TOF mass spectrometry. 

O II godeoxy nucleotides 

[0523] Three sets of complementary oligod eoxy nucleotide pairs were synthesized in which one member of the com- 
plementary oligonucleotide pair contains a 3'- or 5'-disulfide linkage [purchased from Operon Technologies or Ofigos, 
Etc.]. For example, Oligomer 1 [d(CTGATGCGTCGG ATCATCTTTTTT-SS) ; SEQ ID NO: 122J contains a 3'-disuffide 
linkage whereas Oligomer 2 [d(SS-CCTCTTGGGAACTGTGTAGTATT); a 5'-disuffide derivative of SEQ ID NO: 117] 
and Oligomer 3 [d(SS-GAATTCGAGCTCGGTACCCGG); a S'-dlsulfide derivative of SEQ ID NO: 115] each contain a 
S'-disulfide linkage. 

[0524] The oligonucleotides complementary to Oligomers 1 -3 were designed to be of different lengths that are easily 
resolvable from one another during MALDI-TOF MS analysis. For example, a 23-mer oligonucleotide [SEQ ID NO: 
123] was synthesized complementary to a portion of Oligomer 1, a 12-mer oligonucleotide [SEQ ID NO: 121] was 
synthesized complementary to a port ion of Oligomer 2 and a 21 -mer [SEQ ID NO: 116] was synthesized complementary 
to a portion of Oligomer 3. In addition, a fourth 29- mer oligonucleotide [SEQ ID NO: 124] was synthesized that lacks 
complementarity to any of the three oligomers. This fourth oligonucleotide was used as a negative control. 
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Silicon surface chemistry and DMA immobilization 
(a) 4 x 4 (16-locatlon) array 

[0525] A 2 X 2 cm 2 silicon wafer having 256 individual depressions or wells in the form of a 1 6 X 1 6 well array was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas]. The wells were 800 X 
800 ujn 2 , 120 urn deep, on a 1 .125 pitch. The silicon wafer was reacted with 3-amlnopropyltrfethoxysifane to produce 
a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinker SIAB resulting 
in iodoacetamido functionalities on the surface [e.g. , see EXAMPLE 28]. 

[0526] To prepare the oligomers for coupling to the various locations of the silicon array, the disulfide bond of each 
oligomer was fully reduced using 10 mM TCEP as depicted in EXAMPLE 28, and the DNA resuspended at a final 
concentration of 1 OjiM In a solution of 1 00 mM phosphate buffer, pH 8.0. Immediately following disulfide bond reduction, 
the free-thiol group of the oligomer was coupied to the iodoacetamido functionality at 16 locations on the wafer using 
the probe coupling conditions essentially as described above in EXAMPLE 28. To accomplish the separate coupling 
at 16 distinct locations of the wafer, the entire surface of the wafer was not flushed with an oligonucleotide solution 
but, instead, an ~30-n» aliquot of a predetermined modified oligomer was added in parallel to each of 16 locations (I. 
e., depressions) of the 256 wells on the wafer to create a 4 x 4 array of immobilized DNA using a robotic pintool. 
[0527] The robotic pintool consists of 16 probes housed in a probe bJock and mounted on an X Y, Z robotic stage. 
The robotic stage was a gantry system which enables the placement of sample trays below the arms of the robot. The 
gantry unit itself is composed of X and Y arms which move 250 and 400 mm, respectively, guided by brush less linear 
servo motors with positional feedback provided by linear optical encoders. A lead screw driven Z axis (50 mm vertical 
travel) is mounted to the xy axis slide of the gantry unit and Is controlled by an in-line rotary servo motor with positional 
feedback by a motor-mounted rotary optical encoder. The work area of the system is equipped with a slide-out tooling 
plate that holds five microliter plates (most often, 2 plates of wash solution and 3 plates of sample for a maximum of 
1152 different oligonucleotide solutions) and up to ten 20x20 mm wafers. The wafers are placed precisely in the plate 
against two banking pins and held secure by vacuum. The entire system fs enclosed in plexiglass housing for safety 
and mounted onto a steel support frame for thermal and vibrational damping. Motion control is accomplished by em- 
ploying a commercial motion controller which was a 3-axis servo controller and is integrated to a computer; program- 
ming code for specific applications is written as needed. 

[0528] To create the DNA array, a pintool with assemblies that have solid pin elements was dipped into 16 wells of 
a mutti-well DNA source plate containing solutions of Oligomers 1-3 to wet the distal ends of the pins, the robotic 
assembly moves the pin assembly to the silicon wafer, and the sample spotted by surface contact. Thus, one of modified 
Oligomers 1-3 was covalentty immobilized to each of 16 separate wells of the 256 wells on the silicon wafer thereby 
creating a 4 x 4 array of immobilized DNA. 

[0529] In carrying out the hybridization reaction, the three complementary oligonucleotides and the negative control 
oligonucleotide were mixed at a final concentration of 1 0 uM for each oligonucleotide in 1 ml of TE buffer [1 0 mM Tris- 
HCI, pH 8.0, 1 mM EDTA] supplemented with 1 M NaCI, and the solution was heated at 65°C for 1 0 min. Immediately 
thereafter, the entire surface of the silicon wafer was flushed with 800 u.l of the heated oligonucleotide solution. The 
complementary oligonucleotides were annealed to the immobilized oligomers by incubating the silicon array at ambient 
temperature for 1 hr, followed by incubation at 4°C for at least 10 min. Alternatively, the oligonucleotide solution can 
be added to the wafer which is then heated and allowed to cool for hybridization. 

[0530] The hybridized array was then washed with a solution of 50 mM ammonium citrate buffer for cation exchange 
to remove sodium and potassium ions on the DNA backbone (Pieles et a/., (1 993) Nucl. Acids Res. 21^:31 91 -31 96). A 
6-nl aliquot of a matrix solution of 3-hydroxypicolinicacld [0.7 M 3-hydroxypicolinic acld-10 % ammonium citrate in 50 
% acetonitrile; see Wu era/. Rapid Commun. Mass Spectrom. 7:142-146 (1 993)] was added in series to each location 
of the array using a robotic piezoelectric serial dispenser (Le., a piezoelectric pipette system). 
[0531] The piezoelectric pipette system is built on a system purchased from Microdrop GmbH, Norderstedt Germany 
and contains a piezoelectric element driver which sends a pulsed signal to a piezoelectric element bonded to and 
surrounding a glass capillary which holds the solution to be dispensed; a pressure transducer to load (by negative 
pressure) or empty (by positive pressure) the capillary; a robotic xyz stage and robot driver to maneuver the capillary 
for loading, unloading, dispensing, and cleaning, a stroboscope and driver pulsed at the frequency of the plezo element 
to enable viewing of 'suspended' droplet characteristics; separate stages for source and designation plates or sample 
targets (i.e. Si chip); a camera mounted to the robotic arm to view loading to designation plate; and a data station which 
controls the pressure unit, xyz robot, and piezoelectric driver. 

[0532] The 3-HPA solution was allowed to dry at ambient temperature and thereafter a 6-nl aliquot of water was 
added to each location using the piezoelectric pipette to resuspend the dried matrix-DNA complex, such that upon 
drying at ambient temperature the matrix-DNA complex forms a uniform crystalline surface on the bottom surface of 
each location. 
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MALDI-TOF MS analysis 

[0533] The MALDI-TOF MS analysis was performed In series on each of the 16 locations of the hybridization array 
illustrated In Figure 6 essentially as described in EXAMPLE 28. The resulting mass spectrum of oligonucleotides that 
specifically hybridized to each of the 16 locations of the DNA hybridization revealed a specific signal at each location 
representative of observed experimental mass-to-charge ratio corresponding to the specific complementary nucleotide 
sequence. 

[0534] For example, in the locations that have only Oligomer 1 conjugated thereto, the mass spectrum revealed a 
predominate signal with an observed experimental mass-to-charge ratio of 7072.4 approximately equal to that of the 
23-men the theoretical mass-to-charge ratio of the 23-mer is 7072.6 Da. Similarly, specific hybridization of the 12-mer 
oligonucleotide to the array, observed experimental mass-to-charge ratio of 3618.33 Da (theoretical 3622.4 Da), was 
detected only at those locations conjugated with Oligomer 2 whereas specific hybridization of MJM6 (observed exper- 
imental mass-to-charge ratio of 6415.4) was detected only at those locations of the array conjugated with Oligomer 3 
[theoretical 6407.2 Da]. 

[0535] None of the locations of the array revealed a signal that corresponds to the negative control 29-mer oligonu- 
cleotide (theoretical mass-to-charge ratio of 8974.8) indicating that specific target DNA molecules can be hybridized 
to oligomers covatently immobilized to specific locations on the surface of the silicon array and a plurality of hybridization 
assays may be individually monitored using MALDI-TOF MS analysis. 

(b) 8 x 8 (64-location) array 

[0536] A 2 X 2 cm 2 silicon wafer having 256 individual depressions or wells that form a 1 6 X 1 6 array of wells was 
purchased from a commercial supplier [Accelerator Technology Corp., College Station, Texas). The wells were 800 X 
800 nm 2 , 120 um deep, on a 1 .125 pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysirane to produce 
a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinker SIAB resulting 
in iodoacetamido functionalities on the surface as described above. 

[0537] To make an array of 64 elements, a plntool was used following the procedures described above. The pintool 
was dipped into 1 6 wells of a 384 well DNA source plate containing solutions of Oligomers 1-3, moved to the silicon 
wafer, and the sample spotted by surface contact. Next, the tool was dipped in washing solution, then dipped into the 
same 1 6 wells of the source plate, and spotted onto the target 2.25mm offset from the initial set of 16 spots; the entire 
cycle was repeated to make a 2x2 array from each pin to produce an 8x8 array of spots (2x2 elements/pin X 16 pins 
= 64 total elements spotted). 

[0538] Oligomers 1 -3 immobilized to the 64 locations were hybridized to complementary oligonucleotides and ana- 
lyzed by MALDI-TOF MS analysis. As observed for the 16-location array, specific hybridization of the complementary 
oligonucleotide to each of the immobilized thlol-containing oligomers was observed in each of the locations of the DNA 
array. 

EXAMPLE 30 

Extension of hybridized DNA primers bound to ONA templates immobilized on a silicon wafer 

[0539] The SIAB-derivatlzed silicon wafers can also be employed for primer extension reactions of the immobilized 
DNA template using the procedures essentially described In EXAMPLE 7. 

[0540] A 27-trier oligonucleotide [SEQ ID NO: 1 25] containing a 3'-free thiol group was coupled to a SlAB-derivatized 
silicon wafer as described above, for example, in EXAMPLE 28. A 12-mer oligonucleotide primer [SEQ iD NO: 1261 
was hybridized to the immobilized oligonucleotide and the primer was extended using a commercially avaiiable kit [e. 
g., Sequenase or ThermoSequenase, U.S. Biochemical Corp]. The addition of Sequenase DNA polymerase or Ther- 
moSequenase DNA polymerase in the presence of three deoxyribo nucleoside triphosphates (dNTPs; dATP, dGTP, 
dCTP) and dldeoxyribonucleoside thymidine triphosphate (ddTTP) in buffer according to the instructions provided by 
the manufacturer resulted in a 3-base extension of the 12-mer primer while still bound to the silicon wafer. The wafer 
was then analyzed by MALDI-TOF mass spectrometry as described above. The mass spectrum results clearly distin- 
guish the 15-mer [SEQ ID NO: 127] from the original unextended 12-mer thus indicating that specific extension can 
be performed on the surface of a silicon wafer and detected using MALDI-TOF MS analysis. 
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EXAMPLE 31 

Effect of linker length on polymerase extension of hybridized DNA primers bound to DNA templates 
immobilized on a silicon wafer 

[0541] The effect of the distance between the SIAB-conjugated silicon surface and the duplex DNA formed by hy- 
bridization of the target DNA to the immobilized oligomer template was investigated, as well as choice of enzyme. 
[0542] Two SIAB-derivatized silicon wafers were conjugated to the 3'-end of two free thiol-containing oligonucleotides 
of identical DNA sequence except for a 3-base poly dT spacer sequence incorporated at the 3*-end: 

CTGATGCGTC GGATCATCTT TTTT SEQ ID No^ 122 

CTGATGCGTC GGATCATCTT TTTTTTT SEQ ID No. 125. 

These oligonucleotides were synthesized and each was separately immobilized to the surface of a silicon wafer through 
the SiAB cross-linker [ e.g. , see EXAMPLE 28]. Each wafer was incubated with a 12-mer oligonucleotide: 

AAAAAAGATG AT SEQ ID No. 126 

GATG ATCCG A CG SEQ ID No. 1 28 

GATCCGACGC AT SEQ ID No. 129, 

which is complementary to portions of the nucleotide sequences common to both of the oligonucleotides, by denaturing 
at 75 °C and slow cooling the silicon wafer. The wafers were then analyzed by MALDI-TOF mass spectrometry as 
described above. 

[0543] As described in EXAMPLE 30 above, a 3-base specific extension of the bound 12-mer oligonucleotide was 
observed using the oligomer primer where there is a 9-base spacer between the duplex and the surface (SEQ ID NO: 
125]. Similar results were observed when the DNA spacer lengths between the SIAB moiety and the DNA duplex were 
0, 3, 6 and 12. In addition, the extension reaction may be performed using a variety of DMA polymerases, such as 
Sequenase and TTiermo Sequenase (US Biochemical). Thus, the SIAB linker may be directly coupled to the DNA 
template or may include a linker sequence without effecting primer extension of the hybridized DNA. 

EXAMPLE 32 

Spect rochlp mutant detection In ApoE gene 

[0544] This example describes the hybridization of an immobilized template, primer extension and mass spectrometry 
for detection of the wildtype and mutant Apolipoprotein E gene for diagnostic purposes. This example demonstrates 
that immobilized DNA molecules containing a specific sequence can be detected and distinguished using primer ex- 
tension of unlabeled allele specific primers and analysis of the extension products using mass spectrometry. 
[0545] A 50 base synthetic DNA template complementary to the coding sequence of allele 3 of the wildtype apoli- 
poprotein E gene: 

5'- GCCTGGTACACTGCCAGGCGCTTCTGCAGGTCATCGGCATCGCGGAGGAG -3' 
(SEQ ID NO: 280] 

or complement to the mutant apolipoprotein E gene carrying a G ->A transition at codon 158; 

5'-GCCTGGTACACTGCCAGGCACTTCTGCAGGTCATCGGCATCGCGGAGGAG-3' 
(SEQ ID NO: 281] 
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containing a 3'-free thiol group was coupled to separate SIAB-derivatized silicon wafers as described in Example 28. 
[0546] A 21-mer oligonucleotide primer: 

5'-GAT GCC GAT GAC CTG CAG AAG-3' [SEQ ID NO: 282J was hybridized to each of the immobilized templates and 
the primer was extended using a commercially available kit [e.g., Sequenase or Thermosequenase, U.S. Biochemical 
Corp]. The addition of Sequenase DNA polymerase or Thermosequenase DNA polymerase in the presence of three 
deoxyribonucleoslde triphosphates (dNTPs; dATP, dGTP, dTTP) and dideoxyribonucieoside cytosine triphosphate (dd- 
CTP) In buffer according to the instructions provided by the manufacturer resulted In a single base extension of the 
21-mer primer bound to the Immobilized template encoding the wildtype apolipoproteln E gene and a three base ex- 
tension of the 21-mer primer bound to the immobilized template encoding the mutant form of apo lipoprotein E gene. 
[0547] The wafers were analyzed by mass spectrometry as described herein. The wildtype apollpoprotein E se- 
quence results in a mass spectrum that distinguishes the primer with a single base extension (22-mer) with a mass to 
charge ratio of 6771 .1 7 Da (the theoretical mass to charge ratio is 6753.5 Da) from the original 21 -mer primer with a 
mass to charge ration of 6499.64 Da. The mutant apolipoproteln E sequence results in a mass spectrum that distin- 
guishes the primer with a three base extension (24-mer) with a mass to charge ratio of 7386.9 (the theoretical mass 
charge is 7386.9) from the original 21-mer primer with a mass to charge ration of 6499.64 Da. 

EXAMPLE 33 

Detection of Double-Stranded Nucleic Acid Molecules via Strand Displacement and Hybridization to an 
Immobilized Complementary Nucleic Acid 

[0548] This example describes immobilization of a 24-mer primer and the specific hybridization of one strand of a 
duplex DNA molecule, thereby permitting amplication of a selected target molecule in solution phase and permitting 
detection of the double stranded molecule. This method is useful for detecting single base changes, and, particularly 
for screening genomic libraries of double-stranded fragments. 

[0549] A 24-mer DNA primer CTGATGCGTC GGATCATCTT TTTT SEQ ID No. 1 22, containing a 3'-free thiol group 
was coupled to a SIAB -derivatized silicon wafer as described in Example 29. 
[0550] An 1 8-mer synthetic oligonucleotide: 

5'-CTGATGCGTCGGATCATC-3' [SEQ ID NO: 286] was premixed with a 1 2-mer 5'-G ATG ATCCGACG-3' [SEQ ID NO: 
285] that has a sequence that is complementary to 12 base portion of the 1 8-mer oligonucleotide. The oligonucleotide 
mix was heated to 75°C and cooled slowly to room temperature to faciliate the formation of a duplex molecule: 

5 * -CTGATGCGTCGGATCATC-3 ' [SEQ ID NO. 286] 
3' - GCAGCCTAGTAG- 5 ' [SEQ ID NO: 287]. 

[0551] The specific hybridization of the 1 2-mer strand of the duplex molecule to the immobilized 24-mer primer was 
carried out by mixing 1 u.M of the duplex molecule using the hybridization conditions described in Example 30. 
[0552] The wafers were analyzed by mass spectrometry as described above. Specific hybridization was detected in 
a mass spectrum of the 1 2-mer with a mass to charge ratio of 3682.78 Da. 

EXAMPLE 34 

1 -<2-Nttro-5-(3-0^,4'Klimethoxy^^ 
ethane 

A. 2-Nitro-5-(3-hydroxypropoxy)benzardehyde 

[0553] 3-Bromo-1 -propanol (3.34 g, 24 mmol) was refluxed in 80 ml of anhydrous acetonitrile with 5-hydroxy-2-ni- 
trobenzaldehyde (3.34 g, 20 mmol), ^003 (3.5 g). and Kl (100 mg) overnight (15 h). The reaction mixture was cooled 
to room temperature and 150 ml of methylene chloride was added. The mixture was filtered and the solid residue was 
washed with methylene chloride. The combined organic solution was evaporated to dryness and redissolved in 100 
ml methylene chloride. The resulted solution was washed with saturated NaCI solution and dried over sodium sulfate. 
4.31 g (96%) of desired product was obtained after removal of the solvent in vacuo. 
Ri = 0.33 (dlchloromethane/imethanol, 95/5). 

UV (methanol) maximum: 313, 240 (shoulder), 215 nm; minimum: 266 nm. 

'H NMR (DMSO-de) 8 10.28 <s, 1H), 8.17 (d. 1H), 7.35 (d, 1H), 7.22 (s, 1H), 4.22(t, 2H). 3.54 (t, 2H), 1.90 (m, 2H). 
13C NMR (DMSO-de) 5 189.9, 153.0. 141.6, 134.3, 127.3, 118.4, 114.0, 66.2, 56.9, 31.7. 
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B. 2-Nltro-5-(3-0-t-butyldlmethylsllylpropoxy)ben2a[dehyde 

[0554] 2-Nitro-5-(3-hydroxypropoxy)benzaldehyde(1 g, 4.44 mmol) was dissolved in 50 ml anhydrous acotonitrile. 
To this solution, it was added 1 ml of triethylamine, 200 mg of imidazole, and 0.8 g (5.3 mmol) of tBDMSCi. The mixture 
was stirred at room temperature for 4 h. Methanol (1ml) was added to stop the reaction. The solvent was removed in 
vacuo and the solid residue was redissolved in 100 ml methylene chloride. The resulted solution was washed with 
saturated sodium bicarbonate solution and then water. The organic phase was dried over sodium sulfate and the 
solvent was removed in vacuo. The crude mixture was subjected to a quick silica gel column with methylene chloride 
to yield 1 .44 g (96%) of 2-nitro-5-(3«0-l-butyldimethylsilylpropoxy)benzaldehyde. 
R f = 0.67 (hexane/ethyi acetate , 5/1 ) . 

UV (methanol), maximum: 317, 243, 215 nm; minimum:.235, 267 nm. 1 H NMR (DMSO-d 6 ) 8 10.28 (s, 1H), 8^14 (d, 
1H), 7.32 (d, 1H), 7.20 (s, 1 H), 4.20 (t, 2H), 3.75 (t, 2H), 1 .90 (m. 2H), 0.85 (s, 9H), 0.02 (s, 6H). ™C NMR (DMSO-oy 
5 189.6, 162.7, 141.5, 134.0, 127.1, 118.2, 113.8,65.4, 58.5,31.2,25.5,-3.1, -5.7. 

C. 1-(2-Nitro-5-(3-0-t-*>utyldlmethylsilylpropoxy)phenyl)ethanol 

[0555] High vacuum dried 2-nitro-5-(3-0-t-butyldimethylsilylpropoxy)benzaldehyde (1 .02 g, 3 mmol) was dissolved 
50 ml of anhydrous methylene chloride. 2 M Trimethylaluminium In toluene (3 ml) was added dropwise within 10 min 
and keeped the reaction mixture at room temperature. It was stirred further for 1 0 min and the mixture was poured Into 
10 ml ice cooled water. The emulsion was separated from water phase and dried over 100 g of sodium sulfate to 
remove the remaining water. The solvent was removed in vacuo and the mixture was applied to a silica gel column 
with gradient methanol in methylene chloride. 0.94 g (86%) of desired product was isolated. 
R f = 0.375 (hexane/ethyl acetate, 5/1). 

UV (methanol), maximum: 306, 233, 206 nm; minimum: 255, 220 nm. 1 H NMR (DMSO-de) 5 8.00 (d, 1H), 7.36 (s, 1H), 
7.00 (d, 1 H), 5.49 (b, OH), 5.31 (q, 1 H), 4.19 (m, 2H), 3.77 (t, 2H), 1 .95 (m, 2H), 1 .37 (d, 3H), 0.86 (s, 9H), 0.04 (s, 6H). 
13C NMR (DMSO-de) 8 162.6, 146.2, 139.6, 126.9, 112.9, 112.5, 64.8, 63.9, 58.7, 31.5, 25.6, 24.9, -3.4, -5.8. 

D. 1-(2-Nltro-5-<3-hydroxypropoxy)phenyl)ethanol 

[0556] 1 -(2-Nitro-5-(3-Ot-butyldimethylsilylpropoxy)phenyl)ethanol (0.89 g, 2.5 mmol) was dissolved in 30 ml of THF 
and 0.5 mmol of nBu^NF was added under stirring. The mixture was stirred at room temperature for 5 h and the solvent 
was removed in vacuo. The remaining residue was applied to a silica gel column with gradient methanol In methylene 
chloride. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.6 g (99%) was obtained. 
R,=0.17 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 304, 232, 210 nm; minimum: 255, 219 nm. t H NMR (DMSO-d 6 ) 8 8.00 (d, 1 H), 7.33 (s, 1H), 
7.00 (d, 1H), 5.50 (d, OH), 5.28 (t, OH), 4.59 (t, 1H), 4.17 (t, 2H), 3.57 (m, 2H), 1.89 (m, 2H), 1.36 (d, 2H). 
13 C NMR (DMOS-d 6 )8162.8, 146.3. 139.7, 127.1. 113.1, 112.6. 65.5, 64.0, 57.0,31.8,25.0. 

E. 1-(2-Nltro-5-<3-0-4,4'-dimethoxytrrtyIpropoxy)phenyl)ethanol 

[0557] 1 -(2-Nltro-5-(3«hydroxypropoxy)phenyl)ethanol (0.482 g, 2 mmol) was co-evaporated with anhydrous pyridine 
twice and dissolved in 20 m! anhydrous pyridine. The solution was cooled In ice-water bath and 750 mg (2.2 mmol) of 
DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml methanol was added 
to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with toluene twice to 
remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol In methylene 
chloride containing drops of triethylamine to yield 0.96 g (89%) of the desired product 1 -(2-nitro-5-(3-0-4,4'-dimethox- 
ytrityl-propoxy)phenyl)ethanol. 
R,=0.50 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 350 (shoulder), 306, 283, 276 (shoulder), 233, 208 nm; minimum: 290, 258, 220 nm. 

1 H NMR (DMSO-d 6 ) 8 8.00 (d, 1H), 6.82-7.42 (ArH), 5.52 (d, OH), 5.32 (m, 1H), 4.23 (t, 2H), 3.71 (s, 6H), 3.17 (t. 2H). 

2.00 (m, 2H), 1 .37 <d, 3H). 

13C NMR (DMOS-de) 8 162.5, 157.9, 157.7, 146.1, 144.9, 140.1, 139.7, 135.7, 129.5, 128.8, 127.6, 127.5, 127.3, 
126 9, 126.4, 113.0, 112.8, 112.6, 85.2, 65.3, 63.9, 59.0, 54.8, 28.9. 24.9. 

F. 1-(2-Nltro-5-{3-0-4,4'-dimethoxytrity!propoxy 
ethane 

[0558] 1 -(2-Nitro-5-(3-0-4,4 , -dimethoxytritytpropoxy)phenyl)ethanol (400 mg, 0.74 mmol) was dried under high vac- 
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uum and was dissolved in 20 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N,N-dilsopro- 
pylethylamine and 0.3 ml (1 .34 mmol) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction mixture 
was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture was 
washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed In 
vacuo and a quick silica gel column with t % methanol in methylene chloride containing drops of trlethylamine yield 
510 mg (93%) the desired phosphoramidite. 
RpO.87 (dichloromethane/methanol, 99/1). 



EXAMPLE 35 

H4^3-CM,4^Dimethoxytritylpro 
dlfsop ropy lam lnophosphlno)ethane 



A. 4-(3-Hydroxypropoxy)-3-methoxyacetophenone 

[0559] 3-Bromo-1 -propanol (53 ml, 33 mmol) was ref luxed in 1 00 ml of anhydrous acetonitrile with 4-hydroxy-3-meth- 
oxyacetophenone (5 g, 30 mmol), K^COq (5 g), and Kl (300 mg) overnight (15 h). 

Methylenechloride (150 ml) was added to the reaction mixture after cooling to room temperature. The mixture was 
filtered and the solid residue was washed with methylene chloride. The combined organic solution was evaporated to 
dryness and redissolved In 1 00 ml methylene chloride. The resulted solution was washed with saturated NaCI solution 
and dried over sodium sulfate. 6.5 g (96.4%) of desired product was obtained after removal of the solvent in vacuo. 
Rf=0.41 (dichloromethane/methanol, 95/5). 

UV (methanol), maximum: 304, 273, 227, 210 nm: minimum: 291, 244, 214 nm. 

1 H NMR (DMSO-dg) 5 7.64 (d, 1 H), 7.46 (s, 1 H), 7.04 (d, 1 H). 4.58 (b, OH), 4.12 (t, 2H), 3.80 (s, 3H), 3.56 ft 2H), 2.54 
(s, 3H), 1.88 (m, 2H). «C NMR (DMSO-dg) 6 196.3, 152.5, 148.6, 129.7, 123.1, 111.5, 110.3, 65.4, 57.2, 55.5, 31.9, 
26.3. 



B. 4-(3 -Acetoxy pro poxy )-3-methoxyacetop he none 

[0560] 4-(3-Hydroxypropoxy)-3-methoxyacetophenone (3.5 g, 15.6 mmol) was dried and dissorved in 80 ml anhy- 
drous acetonitrile. This mixture, 6 ml of trlethylamine and 6 ml of acetic anhydride were added. After 4 h, 6 ml methanol 
was added and the solvent was removed in vacuo. The residue was dissorved in 1 00 ml dichloromethane and the 
solution was washed with dilute sodium bicarbonate solution, then water. The organic phase was dried over sodium 
sulfate and the solvent was removed. The solid residue was applied to a silica gel column with methylene chloride to 
yield 4.1g of 4-(3-acetoxypropoxy)-3-methoxyacetophenone (98.6%). 
R, = 0.22 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 303, 273, 227, 210 nm; minimum: 290, 243, 214 nm. 

1H NMR (DMSO-de) 5 7.62 (d, 1H), 7.45 (s, 1H), 7.08 (d, 1H), 4.12 (m, 4H, 3.82 (s, 3H), 2.54 (s, 3H), 2.04 (m, 2H), 
2.00 (s, 3H). K 9 

13C NMR (DMSO-de) * 196.3, 170.4, 152.2, 148.6, 130.0, 123.0, 111.8, 110.4, 65.2, 60.8, 65.5, 27.9, 26.3, 20.7. 

C. 4-(3-Acetoxypropoxy)-3-methoxy-6-nltroacetophenone 

[0561] 4-(3-Acetoxypropoxy)-3-memoxyacetophenone (3.99 g, 15 mmol) was added portionwlse to 15 ml of 70% 
HN0 3 in water bath and keep the reaction temperature at the room temperature. The reaction mixture was stirred at 
room temperature for 30 min and 30 g of crushed ice was added. This mixture was extracted with 100 ml of dichlo- 
romethane and the organic phase was washed with saturated sodium bicarbonate solution. The solution was dried 
over sodium sulfate and the solvent was removed in vacuo. The crude mixture was applied to a silica gel column with 
gradient methanol in methylene chloride to yield 3.8 g (81 .5%) of desired product 4-(3-acetoxypropoxy)-3-methoxy- 
6-nitroacetophenone and 0.38 g (8%) of ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1 ,2-dinitrobenzene. 
Side ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1 ,2-dinitrobenzene: 
Rp0.47 (dichloromethane/methanol, 99/1 ). 

UV (methanol), maximum: 334, 330, 270, 240, 212 nm; minimum: 310, 282, 263, 223 nm 

1 H NMR (CDCy 6 7.36 (s, 1H). 7.34 (s, 1H), 4.2B (t, 2H), 4.18 (t, 2H), 4.02 (s, 3H), 2.20 (m, 2H). 2.08 (s, 3H) 

1 *C NMR (CDQ3) 5 170.9, 152.2, 151.1. 117.6, 111.2, 107.9, 107.1, 66.7. 60.6, 56.9. 28.2, 20.9. 

Desired product 4-<3-acetoxypropoxy)-3-methoxy-6-nitroacetophenone: 

RpO. 29 (dichloromethane/methanol, 99/1). 

UV (methanol), maximum: 344, 300, 246, 213 nm; minimum: 320, 270, 227 nm. 
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1 H NMR (CDCl 3 ) 5 7.62 (s, 1 H), 6.74 (s, 1 H), 4.28 <t, 2H), 4.20 (t, 2H), 3.96 (s, 3H), 2.48 (s, 3H), 2.20 (m, 2H), 2.0B (s, 3H). 
1 *C NMR (CDCI3) 5 200.0, 171.0, 154.3, 148.8, 138.3, 133.0, 108.8. 108.0, 66.1, 60.8, 56.6, 30.4, 28.2, 20.9. 

D. 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nltrophenyl)ethanoI 

[0562] 4-(3-Acetoxypropoxy)-3-methoxy-6-nltroacetophenone (3.73 g, 1 2 mmol) was added 1 50 ml ethanol and 6.5 
g of K2CO3. The mixture was stirred at room temperature for 4h and TLC with 5% methanol fn dichloromethane indicated 
the completion of the reaction. To this same reaction mixture, it was added 3.5 g of NaBH 4 and the mixture was stirred 
at room temperature for 2h. Acetone (1 0 ml) was added to react with the remaining NaBH 4 . The solvent was removed 
in vacuo and the residue was uptaken into 50 g of silica gel. The silica gel mixture was applied on the top of a silica 
gel column with 5% methanol in methylene chloride to yield 3.15 g (97%) of desired product 1-(4-(3-hydroxypropoxy)- 
3-methoxy-6-nitrophenyf)ethanol, 

Intermediate product 4-(3-hydroxypropoxy)-3-methoxy-6-nitroacetophenone after deprotection: 
R,=0.60 (dichloromethane/methanoi, 95/5). 

Final product 1-(4-(3-hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanoi: 
R^O.50 (dichloromethane/methanoi, 95/5). 

UV (methanol), maximum: 344, 300, 243, 219 nm: minimum: 317, 264, 233 nm. 

'H NMR (DMSO-de) 5 7.54 (s, 1H), 7.36 (s, 1H), 5.47 (d, OH), 5.27 (m, 1H), 4.55 (t, OH), 4.05 (t, 2H) ( 3.90 (s, 3H), 
3.55 (q, 2H), 1.88 (m, 2H), 1.37 (d, 3H). 

13 C NMR (DMSO-de) 6 153.4, 146.4, 138.8, 137.9, 109.0, 108.1, 68.5, 65.9, 57.2, 56.0, 31.9, 29.6. 

E. 1-(4-(3-0-4,4 -Dlmethoxytritylpropoxy)-3-methoxy-6 -nitrophenyl)ethano! 

[0563] 1 -(4-(3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanot (0.325 g, 1 .2 mmol) was co-evaporated with an- 
hydrous pyridine twice and dissolved in 15 ml anhydrous pyridine. The solution was cooled in ice-water bath and 450 
mg (1 .33 mmol) of DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml 
methanol was added to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with 
toluene twice to remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol 
in methylene chloride containing drops of triethytamine to yield 605 mg (88%) of desired product 1 -(4-(3-0-4,4*-dimeth- 
oxytrityIpropoxy)-3-methoxy-6-nitrophenyl)ethanol. R, = 0.50 (dichloromethane/methanoi, 95/5). 
UV (methanol), maximum: 354, 302, 282, 274, 233, 209 nm; minimum: 322, 292, 263, 222 nm. 
1 H NMR (DMSO-dg) 8 7.54 (s, 1H), 6.8-7.4 (ArH), 5.48 (d, OH), 5.27 (m, 1H), 4.16 (t, 2H), 3.85 (s, 3H), 3.72 (s, 6H), 
3.15 (t, 2H), 1.98 (t, 2H). 1.37 (d, 3H). 

13C NMR(DMSO-d6)5157.8, 153.3, 146.1, 144.9, 138.7, 137.8, 135.7, 129.4, 128.7, 127.5, 127.4, 126.3, 112.9, 112.6, 
108.9, 108.2, 85.1, 65.7, 63.7, 59.2, 55.8, 54.8, 29.0, 25.0. 

F. 1-(4-(3-0-4,4'-Dlmethoxytrrrylpropoxy)-3-methoxy-6 -nttrophenyl)-1-0-((2-cyanoethoxy)- 
dilsopropytamlnophosphlno)ethane 

[0564] 1 -(4-(3-0-4,4 , -Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol (200 mg, 3.5 mmol) was dried under 
high vacuum and was dissolved in 15 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N,N- 
diisopropylethyiamrne and 0.2 ml (0.89 mmol) of 2-cyanoethyl-N.N-diisopropylchlorophosphoramidite. The reaction 
mixture was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture 
was washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed 
in vacuo and a quick silica gel column with 1 % methanol in methylene chloride containing drops of triethytamine yield 
247 mg (91.3%) the desired phosphoramidite 1-(4-(3-0-4,4'-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-0-( 
(2-cyanoethoxy)-diisopropylaminophosphino)ethane. 
R,=0.87 (dichloromethane/methanoi, 99/1). 

EXAMPLE 36 
Oligonucleotide synthesis 

[0565] The oligonucleotide conjugates containing photocleavable linker were prepared by solid phase nucleic acid 
synthesis (see: Sinha et at. Tetrahedron Lett. 1 983, 24, 5843-5846; Sinha etal. Nucleic Acids Res. 1 984, 12, 4539-4557; 
Beaucage et af. Tetrahedron 1993. 49, 6123-61 94; and Matteucci et aL J. Am. Chem. Soc. 1981, 1CX3, 3185-31 91) 
under standard conditions. In addition a longer coupling time period was employed for the Incorporation of photocleav- 
able unit and the 5* terminal amino group. The coupling efficiency was detected by measuring the absorbance of 
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released DMT cation and the results indicated a comparable coupling efficiency of phosphoramkJite 1-(2-nitro-5-<3-0- 

M'-dimethoxytritylpropoxyJphenylH or 1 -(4-(3-0-4,4- 

dimethoxytritylpropoxy)-3-methoxy-6-nitro^ with 
those of common nucleoside phosphoramodites. Deprotection of the base protection and release of the conjugates 
from the solid support was carried out with concentrated ammonium at 55 °C overnight. Deprotection of the base 
protection of other conjugates was done by fast deprotection with AMA reagents. Purification of the MMT-on conjugates 
was done by HPLC (trityf-on) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonrtrile (5% to 25% 
In 20 minutes). The collected MMT or DMT protected conjugate was reduced in volume, detritylated with 80% aqueous 
acetic acid (40 min, 0 °C), desalted, stored at ~20*C. 

EXAMPLE 37 
Photolysis study 

[0566] In a typical case, 2 nmol of oligonucleotide conjugate containing photocleavable linker in 200 u.l distilled water 
was irradiated with a long wavelength UV lamp (Blak Ray XX-15 UV lamp, Ultraviolet products, San Gabriel, CA) at a 
distance of 10 cm (emission peak 365 nm, lamp intensity = 1.1 mW/cm 2 at a distance of 31 cm). The resulting mixture 
was analyzed by HPLC (trityf-off) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonrtrile. Analysis 
showed that the conjugate was cleaved from the linder within minutes upon UV irradiation. 

Equivalents 

[0567] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 



GENERA^ 'INFORMATION 

(i) APPLICANT: 

(A) NAME: SEQUENOM, INC. 

<B) STREET: 11555 Sorrento Valley Road 

(C) CITY: San Diego 

(D) STATE: California 
<E) COUNTRY: USA 

<F> POSTAL CODE (ZIP) : 92121 

( i ) INVENTOR/ APPLICANT : 

(A) NAME: Hubert Roster 

(B) STREET: 836 Via Mallorca Drive 
<C) CITY: La Jolla 

(D) STATE: California 

(D) COUNTRY: USA 

(EJ POSTAL CODE (ZIP) : 92037 

( i ) INVENTOR/ APPLI CANT : 

(A) NAME: Kai Tang 

<B> STREET: 8521 Summerdale Rd #241 

(C) CITY: San Diego 

(D) STATE: California 
<D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92X26 

(i) INVENTOR/ APPLICANT: 

(A) NAME: Zong-Jing Fu 

<B) STREET: 10615 Dabney Dr. #21 

(C) CITY: San Diego 
<D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92126 

(i) INVENTOR/APPLICANT : 

(A) NAME: Carsten W. Siegert 

(B) STREET: Geilstr. 42 

(C) CITY; 22303 Hamburg 

(D) STATE: 

(D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP): 

( i ) INVENTOR/ APPLICANT : 

(A) NAME: Daniel P. Little 

(B) STREET: 393 Glendale Lake Rd. 

(C) CITY: Patton 

(D) STATE : Pennsylvania 
(D) COUNTRY: USA 

(S) POSTAL CODE (ZIP) : 18668 

(i) INVENTOR/ APPLI CANT: 

(A) NAME: G. Scott Higgins 

(B) STREET: Haselweg 1 

(C) CITY: 22880 Weidel 

(D) STATE: 

(D) COUNTRY: Germany 

<E) POSTAL CODE <ZIP) : 
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(i) INVENTOR /APPLICANT : 

(A) NAME: Andreas Braun 

(B> STREET: 13232 Bench! ey Road 

(C) CITY: San Diego 
<D) STATE: California 

(D) COUNTRY: USA 

<E) POSTAL CODE (ZIP) : 92130 

<i) INVENTOR /APPLICANT: 

(A) NAME: Brigitte Da mho f f er-Detnar 

(B) STREET: 3899 Raines St. #8-30B 

(C) CITY: San Diego 

(D) STATE: California 
(DJ COUNTRY: USA 

— (E) POSTAL CODE (ZIP) : 92109 

(i) INVENTOR /APPLICANT : 

(A) NAME: Christian Jurinke 

(B> STREET: Grope Hall 6 8 

(C) CITY: 22115 Hamburg 

(D) STATE : 

(D) COUNTRY: Germany 

<E> POSTAL CODE (ZIP) : 

( i ) INVENTOR /APPLICANT : 

(A) NAME : Dirk Van den Boom 

(B) STREET : For sthausst r . 8 
<C> CITY: 63 3303 Preiech 
<D) STATE: 

(D) COUNTRY: Germany 

(E) POSTAL CODE (ZIP) : 

( i) INVENTOR/ APPLI CANT : 

(A) NAME: Goubing Xiang 

(B) STREET: 113 81 Zapata Ave. 

(C) CITY: San Diego 

(D) STATE: California 
(D) COUNTRY: USA 

<E) POSTAL CODE (ZIP) : 92126 



(ii) TITLE OP THE INVENTION: DMA DIAGNOSTICS BASED ON MASS SPECTROMETRY 

(iii) NUMBER OP SEQUENCES: 320 

(iv) CORRESPONDENCE ADDRESS: 

<A) ADDRESSEE: Brown, Martin, Haller 6 Mcclain 

(B) STREET: 1660 Union Street 

<C) CITY: San Diego 

<D) STATE : CA 

<E) COUNTRY: USA 

<F> ZIP: 92101-2926 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : PastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
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<B) FILING DATE: 06-NOV-1997 
(C) CLASSIFICATION: 

" ; lvIT) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 10/08/97 

(vii) PRIOR APPLICATION DATA: 

{A) APPLICATION NUMBER: 08/933,792 
(B) FILING DATE; 09/19/97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/787,639 

(B) FILING DATE: 01/23/97 

(vii) PRIOR APPLICATION DATA: 

(A> APPLICATION NUMBER: 09/785,988 
(B) FILING DATE: 01/23/97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/746, 055 

(B) FILING DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/746,036 
(B> FILING DATE: 11/06/96 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER; 08/744,590 

(B) FILING DATE: 11/06/96 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 03/744,481 

(B) FILING DATE : 11/06/96 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33.779 

(C) REFERENCE/DOCKET NUMBER: 7352-2004PC 

(ix> TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-238-0999 

(B) TELEFAX : 619*238-0062 

(C) TELEX: 

(2> INFORMATION FOR SEQ ID NO:l: 

(i> SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: qDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRA&4SNT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NOsl: 
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GCAAGTGAAT CCTGAGCGTC 
■ -■'*■:: S- (2) INFORMATION FOR SEQ ID NO:2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
GTGTGAAGGG TTCATATGC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
<b) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
AT CT AT ATT C ATCATAGGAA ACACCACA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
tv) FRAGMENT TYPE: 
<vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID !JO:4: 

GTATCTATAT TCATCATAGG AAACACCATT 

(2) INFORMATION FOR SEQ ID !K- 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

(E) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

""(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : HO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION; SEQ ID NO: 5 
GCTTTGGGGC ATGGACATTG ACCCGTATAA 



(2) INFORMATION FOR SSQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGACTACTA ATTCCCTGGA TGCT3GGTCT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 
(C> STRANDEDNESS: Single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTGCCTGAGT GCAGTATGGT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(O STRANDEDNESS: Single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
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(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:fl: 
AGCTCTATAT CGGGAAGCCT 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTGTGCCACG CGGTTGGGAA TGTA 



<2) INFORMATION FOR SEQ ID NO: 10: 

(i> SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 
AG CAACG ACT GTTTGCCCGC CAGTTG 



(2) INFORMATION FOR SEQ ID NO: 11: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 11: 
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TACATTCCCA ACCGCQTGGC ACAAC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANT I SENSE : NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 12 
AACTGGCGGG CAAACAGTCG TTGCT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
.(B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

{xi) SEQUENCE DESCRIPTION: SBQIDNO;13: 
GCAAGTGAAT CCTGAGCGTG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 
GTGTGAAGGG CGTG 

(2> INFORMATION FOR SEQ ID NO: IS 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 24 base pairs 
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(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iV> ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS 

CTATATTCAT CATAGGAAAC ACCA 

(2 ) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTCACCCTCG ACCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTGTAAAACG ACGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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<v) FRAGMENT TYPE : 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CTTCCACCGC G ATGTTGA 

(2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : unknown 

<ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAGGAAACAG CTATOAC 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID il 
GTCACCCTCG ACCTGCAGC 
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(2) INFORMATION FOR SEQ ID NO: 22: 

^U) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(O) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTI SENSE : NO 

(v) FRAGMENT TYPE: 
(vl) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi22 
GTTGTAAAAC GAOGGCCAGT 

<2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCTACGTCAC AACT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
..-..iiv) ANTISENSE: NO 

" * (v) FRAGMENT TYPE : 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCT G CT GCTGGATGAT 60 
2 AGATCTGG 

(2) INFORMATION FOR SEQ ID NO: 26: 



w CCGACGCATC AGATCTGG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH s 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGATGCGTC GGATCATC 18 
(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GATGAT CCGA CGCATCACAG CTC 23 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ -ID NO:2B 
TCQZ~ CCAA GAGCTGTGAT GCGTCGGATC ATC 

(2) INFORMATION FOR SEQ ID N0:29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPEr nucleic acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GATGATCCGA CGCATCACAG CTC 

<2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULZ TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
liv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(P) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE r 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCGGTTCCAA GAGCT 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
r-rr-i,-- (C) STRANDEDNES5 : single 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISSNSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
TCGGTTCCAA GAGCT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE ; NO 
(v> FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATTTGCTTC TGACACAACT G 

(2) INFORMATION FOR SEQ ID NOi33; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISSNSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTCTCTGTC TCCACATGC 

(2) INFORMATION FOR SEQ ID NO: 34; 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
n*7»(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
TGCACCTGAC TC 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 35; 
TGCTTACTTA ACCCAGTGTG 



<2) INFORMATION FOR SEQ ID NO; 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE ; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CACACTATGT AATACTATGC 

<2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO 37: 
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GAAAATATCT GACAAACTCA TC 22 

(2) INFORMATION FOR SEQ ID NO: 38: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY : unknown 

10 (ii> MOLECULE TYPE: cDNA 

Ciii) HYPOTHETICAL: NO 
(iv> ANT I SENSE : NO 
(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

15 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CATGGACACC AAATTAAGTT C 21 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 39: 
<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 
ID) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii> HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TGAGACTCTG TCTC 14 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS , base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
fiv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION : SEQ ID NO: 40: 
TTCCCCAAAT CCCTG 15 
(2> INFORMATION FOR SEQ ID NO: 41: 

50 

(i) SEQUENCE CHARACTERISTICS: 
tA) LENGTH: 19 base pairs 



55 



40 



45 
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(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 
* <D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTI SENSE : NO 

(v) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 41 
GGCACGGCTG TCCAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

CO STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

AGGCCGCGCT CGGCGCCCTC 

(2) INFORMATION FOR SEQ ID NO: 43: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43; 

GCGGACATGG AGGACGTG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i> SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 



112 



EP 1 164 203 A2 



(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

: ' ^<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
GATGCCOATG ACCTGCAGAA G 

U> INFORMATION FOR SEQ ID NO: 45 : 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv> ANTI SENSE : NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 45 
CCCTTACCCT TACCCTTACC CTAA 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

Ui) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:46; 
AATCCGTGCA GCAGAGTT 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



TGTCAGAGCT GGACAAGTGT 
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» ^ 



(2) INFORMATION FOR SEQ ID NO: 48 

SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEONESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
w (iii) HYPOTHETICAL: NO 

(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



15 



— (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 43: 
GATATTGTCT TCCCGGTAGC 2 0 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNBSS : single 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
25 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

30 

CTCGGACCAG GTGTACCQCC 20 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQ UENCE CHARACTERISTICS: 
35 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : single 
(O) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

45 

CCTGTACTGG AAGGCGATCT C 21 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
so (&) LENGTH*. 20 base pairs 

<B> TYPE: nucleic acid 
<C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

55 
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(ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL. : NO 

(iv) ANTISENSE: NO 
5 i-iV) FRAGMENT TYPE; 

(vi> ORIGINAL SOURCE: 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 51s 
CATGAGGCAG AGCATACGCA 20 

10 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

13 "' (C) STRANDEDNBSS : single 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
20 (V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52; 
GACAGCAGCA CCGAGACGAT 20 

25 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH : 20 base pairs 
(B) TYPE: nucleic acid 

30 <C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
35 (V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 53: 
CGGCTGCGAT CACCGTGCGG 20 
(2> INFORMATION FOR SEQ ID NO:54: 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

45 (C) STRANDBDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : CDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

50 (v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID Tl "> S4 
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GAT CCACTGT GCGACGAGC „ 19 
(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

10 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

15 <xi) SEQUENCE DESCRIPTION: SEQ ID NO:5S: 

GCGGCTGCGA TCACCGTGC 19 

(2) INFORMATION FOR SEQ ID NO:56: . 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
<v) FRAGMENT TYP2 : 
(vi) ORIGINAL SOURCE: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

TGCACCTGAC TC 12 

(2) INFORMATION FOR SEQ ID NO: 57: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

40 (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NOr57: 

CTGTGGTCGT GC 12 

(2) INFORMATION FOR SEQ ID NO: 58: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

55 
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(D) TOPOLOGY: unknown 

^ Mi) MOLECULE TYPE: CDNA 
"(iii) HYPOTHETICAL: NO 
5 (iv) ANT I SENSE : NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

10 GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGCGC 39 

(2) INFORMATION FOR SEQ ID NO: 59: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
' 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE : CDNA 
{iii) HYPOTHETICAL: NO 
20 (iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
25 TCTCTGTCTC CACATGCCCA G 21 

(2) INFORMATION FOR SEQ 12 NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 
30 (B) TYPE: nucleic acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
35 (iv) ANT I SENSE : NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

40 ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 

C 61 

(2) INFORMATION FOR SEQ ID N0:61: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

^Jci) SEQUENCE DESCRIPTION: SEQ ID NO: 61; 

5 AACTAAGCCA TGTGCACAAC A 21 

(2) INFORMATION FOR SEQ ID NO : 62 i 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH : 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

" t i i ) MOLECULE TYPE: RNA 
15 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
20 UCCGGUCUGA UGAOUCCGUG AGGAC 25 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GUCACUACAG GUGAGCUCCA 20 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 

40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 
43 (iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
30 CCAUGCGAGA GUAAGUAGUA 20 

(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 50 base pairs 
-.- y . .-(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

<D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE : RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:65: 
AGGCCUSCGG CAAGACGGAA AOACCAUGGU CCCUNAUCUG CCGCAGGAUC 50 
15 <2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
2V (C> STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

25 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CATTTGCTTC TGACACAACT 20 

30 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE : nucleic acid 
3S (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
TCTCTGTCTC CACATGCCCA G 21 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 
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10 



(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
FRAGMENT TYPB: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
GTCGTCCCAT GGTGCACCTG ACTC 24 
(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 22 base pairs 

(B) TYPE : nucleic acid 
-(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

15 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CGCTGTGGTG AGGCCCTGGG CA 22 
(2) INFORMATION FOR SEQ ID NO: 70: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

So 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPB: 

(vi) ORIGINAL SOURCE: 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
GACGACGACT GCTACCTGAC TCCA 24 
(2) INFORMATION FOR SEQ ID N0:71: 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID :*o 
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ACAGCGGACT GCTACCTGAC TCCA 24 
... J2) INFORMATION FOR SEQ ID NO : 72 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: unknown 



10 



15 



<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
-(.vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 72 : 

TGGAGTCAGG TAGCAGTC 18 

<2> INFORMATION FOR SEQ ID NO:73: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

25 Cii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL! NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

CAGCTCT CAT TTTCCATAC 19 
(2) INFORMATION FOR SEQ ID NO:74: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 
<D) TOPOLOGY: unknown 

40 Cii) MOLECULE TYPE: CDNA 

Ciii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

AGCCCCAAGA TGACTATC 18 
(2) INFORMATION FOR SEQ ID NO: 75: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

AC- 

5 <ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA©1ENT TYPE: 

(vi) ORIGINAL SOURCE : 

w (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

COAGGAGCTC AAGGCCAGAA T 21 
(2) INFORMATION FOR SEQ ID NO: 76: 

/5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 
(±V) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CAGGGGCAGC TCAGCTCTC 19 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

40 

GGCACGGCTG TCCAAGGA 18 
(2) INFORMATION FOR SEQ ID NO: 78 : 

(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
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<vi) ORIGINAL SOURCE: 
. SEQUENCE DESCRIPTION: SEQ ID NO; 78 

5 

AGGCCGCGCT CGGCGCCCTC 20 

(2) INFORMATION FOR SEQ ID NO: 79: 

10 <i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 18 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

— "Si 

,5 (ii) MOLECULE TYPE : cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

CTTACTTGAA TTCCAAGAGC 20 

(2) INFORMATION FOR SEQ ID NO: 80: 

25 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 22 base pairs 
(B) TYPE: nucleic acid 
(CI STRANDEDNESS: single 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

( v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

33 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GGOCTGACTT GCATGGACCG GA 22 

(2) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

45 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 81: 

AGCCAGGACA AG 12 
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(2) INFORMATION FOR SEQ ID NO: 82: 

/^A* SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
10 (iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xd) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
15 ACAGCAGGAA CAGCA 15 

(2) INFORMATION FOR SEQ ID NO: 83 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

20 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
30 GCGGACATGG AGGACGTG 18 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
40 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

45 GATGCCGATG ACCTGCAGAA G 21 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii> MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 
,' iv) ANTI SENSE : NO 
~(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85 
GTGCCCTGCA GCTTCACTGA AGAC 



(2) INFORMATION FOR SEQ ID NO: 86: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY! unknown 

(ii) MOLECULE TYPE: CDMA 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
AGCCAGGACA AG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 • base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 
AGCCAGGACA AGTC 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO : B 8 : 
AGC CAG C"/** AGA 13 

5 

(2) INFORMATION FOR SEQ ID NO: 89 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 

10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE : cDNA 
-Ciii) HYPOTHETICAL: NO 

15 (iv) ANTISENSE: NO 

(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ACAGCACCAA CAGCA 15 

20 

(2) INFORMATION FOR SEQ ID NO: 90: 

♦ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: l? base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 
30 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
35 ACAGCAGGAA CAGCATC 17 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 16 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
45 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

CAGCAG 16 
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(2) INFORMATION FOR SEQ ID NO: 92: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GCGGACATGG AGGACGTG 



(2) INFORMATION FOR SEQ ID NO: 93: " 

, n (i> SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
25 (iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

30 

GCGGACATGG AGGACGTGGC 20 
(2) INFORMATION FOR SEQ ID NO: 94: 



(i) 

35 (A> LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
40 (iii) HYPOTHETICAL: NO 

(iv) ANT IS ENS E : NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

45 

GCGGACATGG AGGACGTGC 19 
(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 

so (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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CD) TOPOLOGY: unknown 

- <CV) MOLECULE TYPE: CDNA 
5 (iti) HYPOTHETICAL: NO 

Uv) AKTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95; 
10 GATGCCGATG ACCTGCAGAA G 21 

(2) INFORMATION FOR SEQ ID NO: 96: 

15 (i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS ; single 
<D) TOPOLOGY: unknown 

2 0 <ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

GATGCCGATG ACCTGCAGAA GC 22 
(2) INFORMATION FOR SEQ ID NO: 97: 

__ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

40 

GATGCCGATG ACCTGCAGAA GTG 23 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

_ rt (ii) MOLECULE TYPE: cDNA 

30 (iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL. SOURCE: 
-- 1L) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GTGCCCTGCA GCTTCACTOA AGAC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

1li) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTGCCCTGCA GCTTCACTOA AGACTG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE : 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
GTGCCCTGCA GCTTCACTOA AGACC 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
TATCTGTTCA CTTGTGCCC 
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(2) INFORMATION FOR SEQ ID NO; 102: 

r- M) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1* base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 
( v> FRACH^ENT TYPE: 
(vi) ORIGINAL SOURCE: 

~(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CAGAGGCCTG GGGACCCTO 19 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C> STRAND EDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO . 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION SEQ ID NO: 103 : 



30 



ACGACAGGGC TGGTTGCC 



18 



(2) INFORMATION FOR SEQ ID NO: 104: 



35 




40 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



45 



ACTGACAACC ACCCTTAAC 



19 



<2) INFORMATION FOR SEQ ID NO: 10^ 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: KO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 105: 
CTGCTTGCCA CAGGTCTC 18 



- (2) INFORMATION FOR SEO ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSB : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
CACAGCAGGC CAGTGTGC 18 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

Cxi) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRACWBNT TYPE : 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GGACCTGATT TCCTTACTG 19 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 
, ,/.yV. FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE i 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TGAATCTGAG GCATAACTG 19 



t0 (2) INFORMATION FOR SEQ ID NO : 109: 



(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 50 base pairs 
--4B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE ; NO 

(v) FRAGMENT TYPE: 

20 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TTGCGTACAC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT SO 

25 

(2) INFORMATION FOR SEQ ID N0:110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
35 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
GTAAAACGAC GGCCAGTGTG TACGCAA 27 

40 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
50 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TACTGOAAGG CGATCTCAGC AATCAGC 27 
5 (2) INFORMATION FOR SEQ ID NO : 112 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
-<iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
15 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

GGCACGGCTG TCCAAGOAG 19 

20 (189) INFORMATION FOR SEQ ID NO: 113: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPK: nucleic acid 

(C) STRANDEDNESS: single 
2S (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

30 (vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
AGGCCGCGCT CGGCGCCCTC 20 

35 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 
45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 114: 
GUCACUACAO GUGAOCUCCA 20 

50 

(2) INFORMATION FOR SEQ ID NO: 115: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
•--'\'3J TYPE: nucleic acid 

5 (C) STRANDEDNBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GAATTCGAGC TCGGTACCCG G 21 

15 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS : 
<A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
2o (C> STRANDEDNBSS: single 

<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 
25 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
CCGGGTACCG AGCTCGAATT C 21 

30 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
CCTCTTGGGA ACTGTGTAGT ATT 23 
45 (2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

• (ii) MOLECULE TYPE: cDNA 
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<iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 
<>;.) FRAGMENT TYPE; 
5 "ivi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

AGGCTGTCTC TCTCCCTCTC TCATACACAC ACACACACAC ACACACACAC ACACACACAC 60 
ACACACACAC TCACACTCAC CCACANNNAA ATACTACACA GTTCC CAAGA GO 112 

10 

<2) INFORMATION FOR SEQ ID NO: 119; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL; NO 
(iv) ANT I SENSE : NO 

20 { V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATAC 49 

25 

(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
35 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATACA CACACACACA 60 
CACACACACA CACACACACA CACACACACA CACTCACACT CACCCACANN NAAATACTAC 120 
ACAGTTCCCA AGAGG 13S 

<2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
33 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 121 
AATACT.-.C"' > AG 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
CTGATGCGTC GGATCATCTT TTTT 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown. 

(ii) MOLECULE TYPE: CDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 
GATGATC CGA CGCATCAGAA TGT 

(2) INFORMATION FOR SEQ ID NO: 124: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
GATCTAGCTG GGCCGAGCTA GGCCGTTGA 

(2) INFORMATION FOR SEQ ID NO: 12 5: 
U) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

■ :_ -{C> STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 
CTGATGCGTC OGATCATCTT TTTTTTT 

(2) INFORMATION FOR SEQ ID NO : 126: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
GATGATCCGA CG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GATGATCCGA CGCAT 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

!: "i ) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 128: 
AAAAAAGATC AT 12 
<2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
'(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
GATCCGACGC AT 12 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 253 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

GGCACGGCTG TCCAAGGAGC TGCAGGCGGC GCAGGCCCGG CTGGGCGCGG ACATGGAGGA 60 
CGTGTGCGGC CGCCTGGTGC AGTACCGCGG CGAGGTGCAG GCCATGCTCG GCCAGAGCAC 120 
CGAGGAGCTG CGGGTGCGCC TCGCCTCCCA CCTGCGCAAG CTGCGTAAGC GGCTCCTCCG 180 
CGATGCCGAT GACCTGCAGA AGTGCCTGGC AGTGTACCAG GCCGGGGCCC GCGAGGGCGC 240 
CGAGCGCGGC CTC 253 

(2) INFORMATION FOR SEQ ID NO: 131: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
liii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
G AATT AC a T3\ CCCAACCGCO TGGCACAACA ACTGGCGGGC AAACAGTCGT TGCTGATT 58 

5 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 57 base pairs 
<B> TYPE: nucleic acid 

10 (c) strandednesS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
t-i-v) ANTISENSE: NO 

13 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
ACCATTAAAO AAAATATCAT CTTTGGTGTT TCCTATGATG AATATAGAAG CGTCATC 57 

20 

<2) INFORMATION FOR SEQ ID NO:133: 

(i) SEQUE NCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
25 (C> STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
{iv) ANTISENSE : NO 

SO (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
CT AT ATT CAT CATAGOAAAC ACCAAAGAT 29 

35 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CTATATTCAT CATAGGAAAC ACCAAT 26 

50 

(2) INFORMATION FOR SEQ ID N0:135: 
(i) SEQUENCE CHARACTERISTICS: 

55 
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(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
..... - ,'C) STRANDEDNESS: single 

"(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 
CTATATTCAT CATAGGAAAC ACCAAAGAT 

(2) INFORMATION FOR SEO ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 
CTATATTCAT CATAGGAAAC ACCAAAGATG ATATTTTC 

(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 
CTATATTCAT CATAGGAAAC ACCAATG ATATTTTC 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 35 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CTATATTCAT CATAGGAAAC ACCAAAGATA TTTTC 

(2) INFORMATION FOR SEQ ID NO:139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 
... (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 : 
CTATATTCAT CATAGGAAAC ACCAAAGATG C 



<2) INFORMATION FOR SEQ ID NO:140: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 200 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 : 

CTTCCACCGC GATGTTGATG ATTATGTGTC TGAATTTGAT GGGGGCAGGC GGCCCCCGTC 
TGTTTOTCGC QGGTCTGGTG TTGATGGTGG TTTCCTGCCT TQTCACCCTC GACCTGCAOC 
CCAAGCTTGO GATCCACCAC CATCACCATC ACTAATAATG CATGGGCTGC AGCCAATTGG 
CACTGGCCGT CGTTTTACAA 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GTCACCTTT^ ACCTGCAGCC CAAGCTTGGG ATCCACCACC ATCACCATCA CTAATAATGC 
5 ATGGGCTGCA GCCAATTGGC ACTGGCCGTC OTTTTACAA 

(2) INFORMATION FOR SEQ ID NO:142: 

(i) SEQUENCE CHARACTERISTICS : 
<A> LENGTH: 15 base pairs 

10 (B) TYPE: nucleic acid 

(C) STRANDED NESS : single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
fiii) HYPOTHETICAL: NO 

15 <iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 2: 

20 TGTACGTCAC AACTA 15 

(2) INFORMATION FOR SEQ ID NO: 143: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
30 (iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
(vl) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
35 TGTACGTCAC AACTAC 16 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQU ENCE CHARACTERISTICS : 
(A) LENGTH: 17 base pairs 

40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
45 (iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE r 

(xi) SEQUENCE DESCRIPTION: SEQ ID tao 144: 

50 TGTACGTCAC AACTACA 17 

(2) INFORMATION FOR SEQ ID NO:14b. 
(i> SEQUENCE CHARACTERISTICS: 

55 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
_-.JC) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMEKT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEO ID NO: 145 
TGTACGTCAC AACTACAA 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
TGTACGTCAC AACTACAAT 

(2) INFORMATION FOR SEQ ID NO:147: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<tii) HYPOTHETICAL: NO 
(iv) ANTISENSE : NO 

(V) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 : 
TGTACGTCAC AACTACAATA 

(2> INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL r NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 
TGTACOTCAC AACTACAATA <3 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
- (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 
TGTACOTCAC AACTACAATA GG 

(2) INFORMATION FOR SEQ ID NO: ISO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length-. 23 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 
TGTACGTCAC AACTACAATA GGC 

(2) INFORMATION FOR SEQ ID NO: 151: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE; nucleic acid 
(C> STRANDEDNESS: single 
(D) topology: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
TGTACGTCAC AACTACAATA GGCC 
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(2) INFORMATION FOR SEQ ID NO:l52: 

. „'\) SEQUENCE CHARACTERISTICS; 
(A} LENGTH : 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

-fxi) SEQUENCE DESCRIPTION: SEQ ID NO.-152 
TGTACGTCAC AACTACAATA GGCCC 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 
TGTACGTCAC AACTACAATA GGCCCT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 154 

TGTACGTCAC AACTACAATA GGCCCTG 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(BJ TYPE-, nucleic acid 

(C) STRAND EDNESS : single 
(D| TOPOLOGY: unknown 
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(ii) MOLECULE TYPEr cDNA 

(iii) HYPOTHETICAL: NO 
. N.v) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE : 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS 5 

TGTACGTCAC AACTACAATA GGCCCTGC 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 
— (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:lS6 
TGTACGTCAC AACTACAATA GGCCCTGCA 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:157 
TGTACGTCAC AACTACAATA GGCCCTGCAC 

(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(XV) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 
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TGTACGTCAC AACTACAATA GGCCCTGCAC C 

. ,.. v:: A2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
-fcvi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

TGTACGTCAC AACTACAATA GGCCCTGCAC CA 

(2) INFORMATION FOR SEQ ID NO:160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : . cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16D 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAG 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGG 

(2) INFORMATION FOR SEQ ID NO: 16^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE; nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 

(iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGC 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGC C 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 37 base pairs 
(B) TYPE: nucleic acid 

<C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 164 : 
TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCA 

(2) INFORMATION FOR SEQ ID NO:16S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
<v) FRAGMENT TYPE: 
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(vi> ORIGINAL SOURCE: 



'r*> SEQUENCE DESCRIPTION: SEQ ID NO:l65: 



5 



TGTACOTCAC AACTACAATA GGCCCTGCAC CAGGCCAO 



38 



(2) INFORMATION FOR SEQ ID NO : 166; 



10 




13 



-<ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 : 



20 



TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAGA 



39 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 
(a> LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 



(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 8: 
CTGATGCGTC GGATCATCCA 20 



35 



CTGATGCGTC GGATCATCC 



19 



(2) INFORMATION FOR SEQ ID NO: 169: 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 21 base pairs 
-_.!3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

( v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 169 
CTGATGCGTC GGATCATCCA G 

(2) INFORMATION FOR SEQ ID NO:170: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 22 base pairs 

(B) TYPE-, nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 
CTGATGCGTC GGATCATCCA GC 

(2) INFORMATION FOR SEQ ID NO:171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE : nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 
CTGATGCGTC GGATCATCCA GCA 

(2) INFORMATION FOR SEQ ID NO:172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 
J.vl) FRAGMENT TYPE: 

ivi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 
CTGATGCGTC GGATCATCCA GCAG 

(2) INFORMATION FOR SEQ ID NO: 173 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 
--. (C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:173 
CTGATGCGTC GGATCATCCA GCAGC 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 174 
CTGATGCGTC GGATCATCCA GCAGCA 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL-. NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
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CTGATCCGTC GGATCATCCA GCAGCAG 

(2) INFORMATION FOR SEQ ID NO:l7€: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS ; single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:I76 

CTGATCCGTC GGATCATCCA GCAGCAGC 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 7 
CTOATGCGTC GGATCATCCA GCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID MO: 178: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO: IT* 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY .- unknown 

'■ ■* v ; : il> MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAOffiNT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SBQ ID NO: 179 
CTGATGCGTC GGATCATCCA GCAGCAOCAG C 

(2) INFORMATION FOR SEQ ID NO-.180: 

Ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: CDNA 
(iii> HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-1B0 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CA 

(2) INFORMATION FOR SEQ ID NO:l8l: 

(i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 181: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAG 

<2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
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<vi> ORIGINAL SOURCE: 
... (xt) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

5 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGC 34 

(2) INFORMATION FOR SEQ ID NO:lB3: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : unknown 

Ui) MOLECULE TYPE: cDNA 
15 (iii> HYPOTHETICAL: NO 

(iv) ANT I SENS B : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCA 35 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C> STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
J0 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 



35 



40 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:184: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAG 36 
(2) INFORMATION FOR SEQ ID NO: 185 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

45 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IBS: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGC 37 
(2) INFORMATION FOR SEQ ID NO: 186: 



55 
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<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 38 base pairs 
<fi) TYPE: nucleic acid 

'"* **" <C) STRANDEDNESS: Single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 
CTOATGCpTC GGATCATCCA GCAGCAGCAG CAGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) -ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG C 
(2) INFORMATION FOR SEQ ID NO:189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 
(y\ FRAGMENT TYPE: 
ivi) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION -. SEQ ID NO: 189 : 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A 
(2) INFORMATION FOR SEQ ID NO: 190; 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
...(C) strandednbss : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 

(v) FRAffl^ENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 190 : 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AG 
(2) INFORMATION FOR SEQ ID NO* 191: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGT 
(2> INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTC 
,.^<2) INFORMATION FOR SEQ ID NO: 193 1 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEBNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
_{vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 193: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCA 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

<ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL; NO 
(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAC 
<2) INFORMATION FOR SEQ ID NO:l95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTC AC G 
(2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDKESS: single 
(D> TOPOLOGY : unknown 

t£i) MOLECULE TYPE: cDNA 
<ii±) HYPOTHETICAL: NO 
Civ) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGC 
(2) INFORMATION FOR SEQ ID NO: 197: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 : 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCT 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEENBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

tvi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA A 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 : 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA AC 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANTISENSE z NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACC 
(2) INFORMATION FOR SEQ ID NO:202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) type r nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCQ 
(2) INFORMATION FOR SEQ ID NO: 203 : 
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Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 55 base pairs 
. JB) TYPE: nucleic acid 
'(C) STRANDEDNBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISBNSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGA 
(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND ED NESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
{iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID SO: 2 04: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAA 
(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE ; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ IIX NO: 205 : 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAAT 
(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 58 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
v fv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO; 206 : 



CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATC 
(2) INFORMATION FOR SEQ ID NO:207: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS; single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANT IS ENS B : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCC 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY x unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTZ SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 208: 
CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209; 
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CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
T 61 

5 "(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

io <d) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
<iv} ANTISENSE : NO 

-( v) FRAGMENT TYPE: 
15 (vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TG 62 

SO 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
30 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
35 1X30 €3 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 64 base pairs 

40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
45 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 2 12: 

SO CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC ^;r?CACCCTA ACCGAATCCC 60 

TGGT 64 

(2) INFORMATION FOR SEQ ID NO:213 

55 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base .pairs 

. . v - r .«.(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
15 TGGTC $ 5 

(2) INFORMATION FOR SEQ ID NO:214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRAND ED KESS : single 

(D) TOPOLOGY: Unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
25 (iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

30 CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 

TGGTCA £5 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : ainole 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: CDNA 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAG 67 

(2) INFORMATION FOR SEQ ID NO: 216: 

so (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

55 
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10 



(D) TOPOLOGY; unknown 

*A A ) MOLECULE TYPE: CDNA 
(lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) .FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCOAATCCC 60 
TGGTCAGA $6 

(2) INFORMATION FOR SEQ ID NO:217: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

<C> S TRANDEDNE S 3 : single 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCOAATCCC 60 
TGGTCAGAT 69 



30 



3S 



40 



(2) INFORMATION FOR SEQ ID NO:218: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 70 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI5ENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 
TGGTCAGATC 70 

45 (2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
so <D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

55 / 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC T 

(2) INFORMATION FOR SEQ ID NO .-220: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) topology: unknown 

(ii> MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 
TGGTCAGATC TT 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
TGCACCTGAC TCC 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO .222: 
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TG CACCTG AC TCCT 

(2) INFORMATION FOR SEQ ID NO: 223: 

- - * 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS ; single 

(D) TOPOLOGY i unknown 

(ii) MOLECULE TYPE; CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
<vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223 
TGCACCTGAC TCCTG 

(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pair 8 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE. NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 
TGCACCTGAC TCCTGT 

(2) INFORMATION FOR SEQ ID NO:22S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 
TGCACCTOAC TCCTGTG 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

5 vi±) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 226 : 

TGCACCTGAC TCCTGTGG 18 

(2) INFORMATION FOR SEQ ID NO: 227: 

15 "(±) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:227r 

TGCACCTGAC TCCTGTGGA 19 
(2) INFORMATION FOR SEQ ID NO: 22 8: 
30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 
(8) TYPE j nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

TGCACCTGAC TCCTGTGGAG 20 

(2) INFORMATION FOR SEQ ID NO:229: 

45 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

50 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

55 



167 



BP 1 164 203 A2 



(Vi) ORIGINAL SOURCE: 

(fTi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 
TQCACCTGAC TCCTGTGGAG A 

(2) INFORMATION FOR SEQ ID NO: 2 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : unknown 

-(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 
TQCACCTGAC TCCTGTGGAG AA 

(2) INFORMATION FOR SEQ ID NO:23l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 
TGCACCTGAC TCCTGTGGAG AAG 

<2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI3ENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID N0:232: 
TGCACCTGAC TCCTGTGGAG AAGT 

(2) INFORMATION FOR SEQ ID NO:233r 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
,<B) TYPE: nucleic acid 

" (C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

TGCACCTGAC TCCTGTGGAG AAGTC 

(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : Single 
<D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 
TGCACCTGAC TCCTGTGGAG AAGTCT 

(2) INFORMATION FOR SEQ ID NO:235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 
TGCACCTGAC TCCTGTGGAG AAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 
FRAGMENT TYPE: 

(vi) ORIGINAL. SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:236 
TGCACCTGAC TCCTGTGGAO AAGTCTGC 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 29 base pairs 

(B) TYPE: nucleic acid 
" (C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(XV) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:237 
TGCACCTGAC TCCTGTGGAO AAGTCTGC C 

(2) INFORMATION FOR SEQ ID NO:238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE* DESCRIPTION: SEQ ID NO:239: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG 

(2) INFORMATION FOR SEQ ID NO:239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239. 
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TGCACCTGAC TCCTGTGGAG AAGTCTGCCG T 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii> HYPOTHETICAL; NO 
(iv) ANTI SENSE : NO 

(v> FRAGMENT TYPE: 
„tvi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TT 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 33 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTA 

<2) INFORMATION FOR SEQ ID NO: 242 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTAC 

(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) topology: unknown 

H i ) MOLECULE TYPE : . cDNA 
(iii> HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACT 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQU ENCE CHARACTERISTICS: 
(A) LENGTH : 36 base pairs 
(BJ TYPE; nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY r unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTG 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUE NCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTG C 

(2) INFORMATION FOR SEQ ID NO:246 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 

u 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCC 

(2) INFORMATION FOR SEQ ID NO: 24 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

Ui> MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTISENSB: NO 
<V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE z 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCC 
(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOrOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT 
(2) INFORMATION FOR SEQ ID NO:249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT G 
(2) INFORMATION FOR SEQ ID NO: 250: 
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<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 
>. r'J*) TYPE : nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
10 <v) FRAGMENT TYPE: 

(Vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250; 
TGCACCTGAG TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GT 42 

15 

(2) INFORMATION FOR SEQ ID NO:251: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH : 43 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS ; single 

(DI TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
25 (V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTG 43 
30 (2) INFORMATION FOR SEQ ID NO;252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE : nucleic acid 
33 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
40 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGG 44 
45 (2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE J cDNA 

55 
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(iii) HYPOTHETICAL; NO 

(iv) ANT I SENSE : WO 
v .'.y.) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;253: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
-- (C> STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
TGCACCTGAC TCCTGTOGAG AAGTCTGCCG TTACTGCCCT GTGGGG 
(2) INFORMATION FOR SEQ ID NO:25S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB.: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGC 
(2) INFORMATION FOR SEQ ID NO:2S6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID N0 2S6: 
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TGCACCTGAC TCCTGTGGAG AAGT.CTGCCG TTACTGCCCT GTGGGGCA 
A2) INFORMATION FOR SEQ ID NO:257: 

. - --7 " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
ly±) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAA 
(2) INFORMATION FOR SEQ ID NO:258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25B: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG 
(2) INFORMATION FOR SEQ ID NO:259: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 59: 
TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG G 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

5 \xi) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG GT 52 
(2) INFORMATION FOR SEQ ID NO: 261: 

ts (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(v> FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 60 
GACTCCTGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC AAGGTGAACG TGGATOAA3T 120 
TGGTGGTGAG GCCCTGGGCA GGTTGGTATC AAGGTTACAA GACAGGTTTA AGGAGACCAA 180 
TAGAAACTGG GCATGTGGAG ACAGAGAAG 209 

30 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQU ENCE CHARACTERISTICS: 
(A) LENGTH: 88 base pairs 
(B> TYPE: nucleic acid 

3S (C) STRANDEDNESS: single 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
^ (v) FRAGMENT TYPE: 

<vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO. 262: 

TGAGACTCTG TCTCAAAAAT AAATAAATAA ATAAATAAAT AAATAAATAA ATAAATAAAT 60 
45 AAATAAATAA GTAAAAAAGA AAGAATGC 88 

(2) INFORMATION FOR SEQ ID NO:26J: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDHA 

(iii) HYPOTHETICAL: NO 
^ V ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

GTGTGTGTGT GTGTGTGTT TTTTTTT AAC AGGGATTTGG GGAATTATTT GAGA 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
-(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 
TTCCCCAAAT CCCTGTTAAA AAC 



<2) INFORMATION FOR SEQ ID NO:265: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 : 
TTCCCCAAAT CCCTGTTAAA AAAAC 

(2) INFORMATION FOR SEQ ID NO:266: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
TTCCCCAAAT CCCTGTTAAA AAAAAAC 27 

5 

(2) INFORMATION FOR SEQ ID NO:267: 

(i) SEQUENCE CHARACTERISTICS: 
(A) length : 103 base pairs 
<B) TYPE: nucleic acid 

10 iC} STRANDEDNESS : single 

(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

..(iv) ANT I SENSE : NO 
15 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

GTAAAACGAC CGCCAGTGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCGG 60 
2Q GTACCGAGCT CGAATTCGTA ATCATGGTCA TAGCTGTTTC CTG 103 



(2) INFORMATION FOR SEQ ID NO: 26 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 78 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGNCTG 60 
GGCATGTGGA GACAGAGA 78 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE ; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
TCTCTGTCTC CACATGCCCA GNCTC CTCAG GACTCAGGTG CACATGGTGT CTGTTTGAGQ 60 



55 



179 



EP 1 164 203 A2 



CATGGCGCAC CTGAGCTC 78 

(2) INFORMATION FOR SEQ ID NO:270: 

<i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 
(yj.) ORIGINAL SOURCE: 

/$ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 
GCATGGCGCA CGTGACTC 78 

20 (2) INFORMATION FOR SEQ ID NO: 271 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 
GCATGGCGCA CCTGACTCCT GA 82 

(2) INFORMATION FOR SEQ ID NO: 272: 



25 



30 



35 



<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 42 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
40 <D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT IS ENS B : NO 

(v) FRAGMENT TYPE: 

45 (vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CG 42 
50 (2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

. (C) ST HANDEDNESS : single 

5 ' ~~ ' (D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

10 <vi) ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 2 73: 
CACCTGACTC CTA 13 
15 (2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

25 (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 
CACCTGACTC CTGGA 14 
30 (2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQU ENCE CHARACTERISTICS t 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
Uii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 
CACCTGACTC CTGA 14 
4S (2) INFORMATION FOR SEQ ID NO: 276: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^ <D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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10 



15 



20 



(iv) ANTI SENSE : NO 

(v) FRA(2*l£NT TYPE: 
-'■J'*-.) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

CCATGGTGTC TGTTTGAGGC ATGGCG 26 

(2) INFORMATION FOR SEQ ID NO: 277: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : Single 

- (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 
AGTCATCTTG GGGCT 75 

25 (2) INFORMATION FOR SEQ ID NO: 278 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 61 base pai*s 
<B) TYPE : nucleic acid 

(C) STRANDEDNESS: single . 
3Q (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

35 (vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 
C 61 

(2) INFORMATION FOR SEQ ID NO:279: 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE i nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 27 9 
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GUCACUACAG GUGAGCUCCA 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNE5S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANT Z SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
CTCAGTCCAC GTGGTACCCT GCTG 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDBDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

CATTTGCTTC TGACACAACT GTGTT CACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 
GACTCCTGAG GAGAAGTCTG CCGTT 85 



(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARA CTER ISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
ACGGGTCCCG GAGTGGTGTC GC 

(2) INFORMATION FOR SEQ ID NO:283: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 base pairs 

f *~fB) TYPE: nucleic acid 
5 '(C) STRANDEDNESS : single 

<r>) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
10 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Ixi> SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 
15 GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown' 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 84: 

30 ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGATTG 60 

GTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:285: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
^ (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

45 ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 

CTATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:286: 

33 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

55 
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(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE; CDNA 
HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 86: 

10 ACTGCCCTGT GGGG CAAGOT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 

GCATCAAGGT TACAAG 76 

(2) INFORMATION FOR SEQ ID NO:287: 

"(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

25 

ACAG G T TTA A GGAGACCAAT AGAAACTGGG CATGTGGAGA CAGAGAAG 48 
(2) INFORMATION FOR SEQ ID NO: 288 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 base pairs 

<B> TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDHA 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 88 : 

40 

GACGACGACT GCT ACCTGAC TCCA 24 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(yi> ORIGINAL SOURCE: 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO : 28 9 : 

" O - *" 

5 

ACAGCGCACT GCTACCTGAC TCCA 24 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pair© 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

15 (ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
<v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

TGGAGTCAGG TAGCAGTC 18 

(2) INFORMATION FOR SEQ ID NO:29l: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; unknown 

* 

(ii) MOLECULE TYPE: cDNA 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 

35 

CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i> SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 6S base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
45 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 
(V) FRAGMENT TYPE: 
(vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92: 

SO 

AGTCATCTTG GGGCTGTCGA GAGTAAAAGG TATGTCAGTC ATAGTTAAGA CCTTCTTAAA 
GGTCT 65 
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(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPB; nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

Jxi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 
GTAATTTCTA TCAGTAGAAC CCCGA 

(2) INFORMATION FOR SEQ ID N0:294: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPB: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(II) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94: 
CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAOAT 
(2) INFORMATION FOR SEQ ID NO:295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPB: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
AGTCATCTTG GGGCT 

(2) INFORMATION FOR SEQ ID NO: 296: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE : CDMA 

(iii) HYPOTHETICAL: NO 
-h'^v? ANTISENSE: NO 

(vT FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
C AGCTCT CAT TTTCCATACA GTCAGTATCA A TTCT GGAAG AATTTCCAGA CATTAAAGAT 60 
(2) INFORMATION FOR SEQ ID NO;297: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 

-(B) TYPE: nucleic acid 
is (C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
20 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
AGTCATCTTG GGGCTA 16 
(2) INFORMATION FOR SEQ ID NO: 2 98: 



25 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
3S (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
CAGCT CTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCT 43 
(2) INFORMATION FOR SEQ ID NO: 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(Vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 9: 
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CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCTA 44 
. ( 2 )._ INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 



10 



15 



(ii> MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
Civ) ANTISENSE: NO 
(v) FRAGMENT TYPE: 
(yi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
CAGCTCTCAT TTTCCATACA GT 22 
(2) INFORMATION FOR SEQ ID NO: 301: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NC 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE s 

30 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3 01: 

CAGCTCTCAT TTTCCATACA T 21 
(2) INFORMATION FOR SEQ ID NO: 302: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

40 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 : 

GCCTGGTACA CTGCCAGGCG CTTCTGCAGG TCATCGGCAT CGCGGAGGAG SO 
(2) INFORMATION FOR SEQ ID NO:303: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 
GCCTGGTACA CTGCCAGGCA CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
GATGCCGATG ACCTGCAGAA G 



(2) INFORMATION FOR SEQ ID NO : 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 OS: 
GATGCCGATG ACCTGCAGAA GC 

(2) INFORMATION FOR SEQ ID NO: 306; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE:. 
<vi) ORIGINAL SOURCE: 

" "\Sci) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

GATGCCGATG ACCTGCAGAA GTGC 

(2) INFORMATION FOR SEQ ID NO: 307: 

<i) SEQU ENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAOdENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 
GATQAT C CO A CG 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : CDNA 

(iii) HYPOTHETICAL: NO 
(lv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 
CTGATGCGTC GGATCATC 

(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vl) ORIGINAL SOURCE: 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 309 
GATGATCCGA CG 
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(2) I N FORMATION FOR SEQ ID NO: 310: 

(±> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C } STRANDEDNESS : s ingl e 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
<iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 
(v) FRAGMENT TYPE: 
<vi> ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 
GGCGCGGACA TGGAGGACGT GTGCGGCCGC CTGGT 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE; NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 311 
TCCGCGATGC CGATGACCTG CAGAAGCGCC TGGC 

(2) INFORMATION FOR SEQ ID NO: 3 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSB: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 
CGGCTGCGAT CACCGTGCGG CACAGCT 

(2) INFORMATION FOR SEQ ID NO: 313 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

( B) TYPE: nucleic acid 

< C ) STRANDEDNESS : 8 ingl e 
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(D) TOPOLOGY: unknown 

Ui) MOLECULE TYPE: cDNA 
5 Uii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

10 

CGGCTGCOAT CACCGTGCGG T 21 
(2) INFORMATION FOR SEQ ID NO: 314: 

(A) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 27 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
20 (iii> HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(xi> SEQUENCE DESCRIPTION: SEQ 10 NO: 314: 

25 

CGGCTGCOAT CACCGTGCGG AACAGCT 2 7 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

40 

CGGCTGCGAT CACCGTGCGG CA 22 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 

CGGCTGC^.*^- CACCGTGCGG TA 22 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(B> TYPE: nucleic acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 

15 (v) FRAGMENT TYPE: 

(vi> ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:317: 
CGGCTGCGAT CACCGTGCGG A 21 

20 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base paira 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 
<iv) ANTI SENSE : NO 

30 ( V ) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:318: 
ATCATCAACT GGAAGATCAG GTCAGGAGCC ACTTGCCANC CT 42 

35 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

40 (c) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 
45 (v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:319: 
ATCATCACAC TGGAAGACTC CAGGTCAGGA GCC 33 

SO 

(2) INFORMATION FOR SEQ ID NO:320: 
(i) SEQUENCE CHARACTERISTICS: 

55 
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(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONBSS : single 
5 ^(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

io (vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 
ATCCACTACA ACTACATGTG TAACAGTTGG wGCwwGCC 48 

15 

Claims 

1 . A process for determining the order of base specifically terminated nucleic acid fragments of a target nucleic acid 
20 molecule, comprising the steps of: 

a) obtaining a nucleic acid molecule, comprising the target nucleic acid sequence and, at one end, a tag; 

b) generating base specifically terminated nucleic acid fragments from the target nucleic acid; and 

c) analyzing the fragments by a mass spectrometry format, thereby determining the order of the base specif- 
2 * icalry terminated nucleic acid fragments in the target nucleic acid molecule. 

2. A process of claim 1 , wherein in step b), a nuclease is contacted with the target nucleic acid to generate the base 
specifically terminated nucleic acid fragments. 

so 3. A process of claim 2, wherein the nuclease is a restriction enzyme that can recognize and cleave at least one 
restriction site in the target nucleic acid. 

4. A process of claim 2, wherein the target nucleic acid is a deoxyribonucleic acid and the nuclease is a deoxyribo- 
nudease. 



35 
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5. A process of claim 2, wherein the target nucleic acid is a ribonucleic acid and the nuclease is a ribonuclease. 



6. A process of claim 5, wherein the ribonuclease is selected from the group consisting of : the G -specif ic ribonu- 
clease, the A-specific U 2 ribonuclease, the A/U specific PhyM ribonuclease, the U/C specific ribonuclease A, the 

40 C-speciflc chicken liver ribonuclease and crisavrtln. 

7. A process of claim 1 , wherein in step b), the base specifically terminated nucleic acid fragments are generated by 
performance of a combined amplification and base-specific termination reaction. 

*s 8. A process of claim 7, wherein the combined amplification and base-specific termination reaction Is performed using 
a first polymerase, which has a relatively low affinity towards at least one chain terminating nucleotide, and a 
second polymerase, which has a relatively high affinity towards at least one chain terminating nucleotide. . 



9. A process of claim 8, wherein the first and second polymerases are thermostable DNA polymerases. 



10. A process of claim 9, wherein the thermostable DNA polymerases are selected from the group consisting of: Taq 
DNA polymerase, AmpliTaq FS DNA polymerase, Deep Vent (exo-) DNA polymerase, Vent DNA polymerase, Vent 
(exo*) DNA polymerase, Vent DNA polymerase, Vent (exo-) DNA polymerase, Deep Vent DNA polymerase, Thermo 
Sequenase, exo(-) Pseudococcus furiosus (Pfu) DNA polymerase, AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, 

5$ Pyrococcus furiosus (Pfu) and Pyrococcus woesei (Pwo) DNA polymerase. 

11. A process of claim 1 , wherein the base specifically terminated nucleic acid fragments generated in step b) include 
mass modified nucleotides. 
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12. A process of claim 1 , wherein the tag comprises a 3* tag. 

13. A process of claim 1 , wherein the tag comprises a 5* tag. 

14. A process of claim 12 or 13, wherein the tag is a non-natural tag. . 

15. A process of claim 14, wherein the non-natural tag is selected from the group consisting of: an affinity tag and a 
mass marker 

16. A process of claim 15, wherein the affinity tag facilitates immobilization of the nucleic acid to a solid support. 

17. A process of claim 16, wherein the affinity tag is biotin or a nucleic acid sequence that (s capable of binding to a 
capture nucleic acid sequence that is bound to a solid support. 

18. A method for detecting neoplasla/malagnancles In a tissue or ceil sample, comprising detecting telomerase activity, 
mutation of a proto-oncogene, or expression of a tumor specific gene in the sample by detecting nucleic acids that 
encode the telomerase, that are specific for the mutation or that encode the tumor-specific by mass spectrometry. 

19. The method of claim 18 that is a method for detecting a neoplasia/malignancy in a tissue or cell sample, comprising: 

a) isolating telomerase from the sample and adding a synthetic DNA primer, which is optionally immobilized, 
complementary to a telomeric repeat, and all four deoxynucleotide triphosphates under conditions that result 
in telomerase specific extension of the synthetic DNA; 

b) amplifying the telomerase extended DNA product; and 

c) detecting the DNA product by mass spectrometry, wherein telomerase-speclflc extension is indicative of 
neoplasia/malignancy. 

20. The method of claim 1 9, wherein the primer contains a linker moiety for immobilization on a support; and the 
amplified primers are isolated by conjugating the linker portion to a solid support. 

21 . A process for detecting telomerase activity in a biological sample, comprising the steps of: 

a) incubating the biological sample; a substrate primer, which can be extended by telomerase activity; and a 
complete set of deoxy nucleoside triphosphates; and 

b) detecting a telomerase extended substrate primer by mass spectrometry, thereby detecting telomerase 
activity in the biological sample. 

22. The process of claim 21, wherein the substrate primer is immobilized to a solid support. 

23. The process of claim 22, wherein the substrate primer is immobilized In an array on the solid support. 

24. The process of claim 21 , further comprising amplifying the telomerase extended substrate primer prior to mass 
spectrometry. 

25. The method of claim 18 that is a method for identifying cells or tissues transformed by a mutant proto-oncogene, 
comprising: 

a) in a cell or tissue sample, amplifying a portion of a proto-oncogene that includes a codon indicative of 
transformation, wherein one primer comprises a linker moiety for Immobilization; 

b) Immobilizing DNA via the linker moiety to a solid support, optionally In the form of an array; 

c) hybridizing a primer complementary to the proto-oncogene sequence that is upstream from the codon 

d) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP 
location; 

e) ionizinp/volatizing the sample; and 

f) detecting the mass of an extended DNA indicative of a mutant proto-oncogene, thereby identifying cells or 
tissues transformed by a mutant proto-oncogene. 

26. The method of claim 25, wherein the proto-oncogene is the RET proto-oncogene. 
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27. The method of claim 26, wherein the codon indicative of transformation is codon 634 of the RET proto -oncogene. 

28. The method of claim 18 that is a method for detecting expression of a tumor-specific gene, comprising: 

s a) isolating polyA RNA from the sampte; 

b) preparing a cDNA library using reverse transcription; 

c) amplifying a cDNA product, or portion thereof, of the tumor-specific gene with a set of primers, wherein one 
primer of the set of primers comprises a linker moiety; 

d) isolating the amplified product by immobilizing the DNA to a solid support via the linker moiety; 
"> e) optionally conditioning the DNA; and 

f) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the 
gene. 

29. The method of claim 28, wherein the cells are bone marrow ceils, the gene is the tyrosine hydroxylase gene, and 
*5 expression of the gene is indicative of neuroblastoma. 

30. A method for directly detecting a double-stranded nucleic acid using matrix-assisted laser desorptlon/ionizatlon 
time-of-f light (MALDl-TOF) mass spectrometry, comprising: 

20 a) isolating a double-stranded DNA fragment from a cell or tissue sample; 

b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA, 
wherein the conditions include one or both of the following: preparing samples for analysis at a temperature 
of about 4*C or less, and using a high concentration of DNA in the matrix to drive duplex formation; 

c) ionizing/volatizing the DNA of step b), using a low acceleration voltage; and 

25 d) detecting the presence of the double -stranded DNA by MALDl-TOF mass spectrometry. 

31. A method for comparing DNA samples to discern relatedness or to detect mutations, comprising: 

a) obtaining a plurality of biological samples; 
30 b) amplifying a region of DNA from each sample that contains two or more microsatellite DNA repeat sequenc- 

es; 

c) detecting the presence of the amplified DNA from each sample by mass spectrometry and comparing the 
molecular weights of the amplified DNA, wherein different sizes are indicative of non-identity or mutations 
between or among the samples. 

33 

32. The method of claim 31 , wherein non-identity is indicative of the presence of a mutation in the DNA In one sample, 
of non-relatedness, or of non-HLA compatibility between or among the individuals from whom the samples were 
obtained. 



40 33. The method of daim 31 or 32, wherein a plurality of markers are examined simultaneously. 
34. A method for identifying a target nucleotide in a nucleic acid sequence, comprising: 

a) amplifying at least a portion of the nucleic acid sequence comprising the target nucleotide using; 
(i) a first primer, wherein: 



45 



the 5'-end of the primer shares Identity to a portion of the nucleic acid sequence immediately down- 
stream from the target nucleotide followed by a sequence encoding a unique restriction endonuclease 
so site, and 

the 3*-end of the primer is self-complementary; and 

(ii) a second downstream primer that contains a tinker moiety, 

thereby producing an amplified double stranded nucleic acid comprising at least a portion of the 
55 nucleic acid sequence comprising the target nucleotide; 

b) immobilizing the amplified double-stranded nucleic acid to a solid support via the linker moiety; 

c) denaturing the immobilized nucleic acid and isolating the non- immobilized strand; 
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d) annealing the intracomplementary sequences in the 3'-end of the isolated non-immobilized strand, such 
that the 3'- end is extendable by a polymerase, thereby producing a self-annealed nucleic acid; 
f) extending the self -annealed nucieic acid by incubation with a polymerase, 3 dNTPs and 1 ddNTP, which 
corresponds to the missing dNTP, thereby producing an extended nucleic acid; 
s g) cleaving the extended nucleic acid with a restriction endo nuclease specific for the unique restriction endo- 

nuclease site; and 
h) identifying the target nucleotide. 

35. The method of claim 34, wherein identifying the target nucleotide is indicative of a mutation in the nucieic acid 
10 sequence. 

36. The method of claim 34, wherein the target nucleotide is identified based on the mass of the extended nucleic acid. 

37. The method of claim 35, wherein the mass of the extended nucleic acid is determined by mass spectrometry. 

is 

38. A method for detecting a target nucleic acid in a biological sample using RNA amplification, comprising: 

a) amplifying a target nucleic acid using a primer comprising a sequence that is complementary to the target 
sequence, or a complement thereof, and a sequence that encodes an RNA polymerase promoter; 
20 b) synthesizing RNA using an RNA polymerase that recognizes the promoter, and 

c) detecting the resulting RNA using mass spectrometry, thereby detecting the presence of the target nucleic 
acid sequence in the biological sample. 

39. A process of detecting the presence of a target nucleic acid sequence, comprising the steps of: 

25 

a) incubating in a reaction mixture 

i) an RNA polymerase, 

ii) nucleoside triphosphates, and 

30 Hi) a nucleic acid molecule comprising the target nucleic acid sequence, or a complement thereof, and a 

promoter for the RNA polymerase, 

thereby producing an RNA molecule comprising the target nucleic acid sequence, or a complement 
thereof; and 

35 b) detecting the RNA molecule by mass spectrometry, thereby detecting the presence of the target nucieic 

acid sequence. 

40. The process of claim 39, wherein the nucleic acid molecule comprising the target nucleic acid sequence is DNA 
and the RNA polymerase is a DNA dependent RNA polymerase. 

40 

41. The process of claim 40, further comprising, following step a) and prior to step b), 

inactivating the RNA polymerase; and 
digesting the DNA using RNAse-free DNAse I. 

45 42. The process of claim 39, further comprising: 

hybridizing a detector oligonucleotide to the RNA molecule comprising the target nucleic acid sequence, 
wherein the detector oligonucleotide is complementary to a portion of the target nucleic acid sequence; and 
removing un hybridized detector oligonucleotide; 

50 

wherein, in step b), the RNA molecule comprising the target nucleic acid sequence is detected by detecting 
hybridized detector oligonucleotide. 

43. A method for detecting the presence of a target nucleic acid sequence, comprising: 

55 

a) amplifying the target nucieic acid sequence using a primer comprising a sequence that is complementary 
to at least a portion of the target sequence, or a complement thereof, and a sequence that encodes an RNA 
polymerase promoter, thereby producing an amplified nucleic acid molecule comprising the target nucleic acid, 
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or complement thereof, and an RNA polymerase promoter; 

b) incubating the amplified nucleic acid molecule with nucleoside triphosphates and an RNA polymerase that 
recognizes the promoter, thereby producing an RNA corresponding to the target nucleic acid sequence; and 

c) detecting the RNA using mass spectrometry, thereby detecting the presence of the target nucleic acid se- 
5 quence. 

44. A process for detecting a target nucleic acid sequence present in a biological sample, comprising the steps of: 

a) obtaining a nucleic acid molecule containing a target nucleic acid sequence from a biological sample; 
10 b) immobilizing the target nucleic acid sequence on a solid support viathiol linkages, whereby the target nucleic 

acid sequence is present at a sufficient density to detect it using mass spectrometry; 

c) hybridizing a detector oligonucleotide with the target nucleic acid sequence; 

d) removing unhybridized detector oligonucleotide; 

e) ionizing and volatizing the product of step c); and 

is f) detecting the detector oligonucleotide by mass spectrometry, 

wherein detection of the detector oligonucleotide indicates the presence of the target nucleic acid sequence in the 
biological sample. 

20 45. The process of claim 44, wherein the target nucleic acid sequence is amplified prior to immobilization. 

46. The process of claim 44 or 45, wherein at least one of the detector oligonucleotide or the target nucleic acid 
sequence has been conditioned. 

25 47. A process of any of claims 44-46, wherein the solid support is selected from the group consisting of: beads, flat 
surfaces, pins and combs, 

48. A process of any of claims 44-47, wherein the target nucleic acid is immobilized in the form of an array. 
30 49. a process of any of claims 44-48, wherein the support is a silicon wafer 

50. A process of any of claims 44-49, wherein the target nucleic acid sequence is amplified by an amplification pro- 
cedure selected from the group consisting of cloning, transcription, the polymerase chain reaction, the ligase chain 
reaction, and strand displacement amplification. 

35 

51. A process of any of claims 44-50, wherein the mass spectrometer is selected from the group consisting of: Matrix- 
Assisted Laser Desorption/lonization Time-of-Flight, Electrospray, Ion Cyclotron Resonance, and Fourier Trans- 
form. 

40 52. A process of any of claims 44-51 , wherein the sample is conditioned by mass differentiating at least two detector 
oligonucleotides or oligonucleotide mimetics to detect and distinguish at least two target nucleic acid sequences 
simultaneously. 

53. A process of claim 52, wherein the mass differentiation is achieved by differences in the length or sequence of the 
J* at least two oligonucleotides. 

54. A process of claim 53, wherein the mass differentiation is achieved by the introduction of mass modifying func- 
tionalities in the base, sugar or phosphate moiety of the detector oligonucleotides. 

50 55. a process of claim 52, wherein the mass differentiation is achieved by exchange of cations at the phosphodi ester 
bond. 

56. A process of any of claims 44-55, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into DNA using mass modified dideoxynucleoslde triphosphates and DNA dependent DNA polymerase prior to 

ss mass spectra metric detection. 

57. A process of any of claims 44-56, wherein the nucleic acid molecule obtained from a biological sample is amplified 
into RNA using mass modified ribonucleoside triphosphates and DNA dependent RNA polymerase prior to mass 
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spectrometry detection. 

58. A process of any of claims 44-57, wherein the target nucleic acid sequence is indicative of a disease or condition 
selected from the group consisting of a genetic disease, a chromosomal abnormality, a genetic predisposition, a 

s viral infection, a fungal infection and a bacterial infection. 

59. A process of any of claims 39-58, wherein the detector oligonucleotide is a peptide nucleic acid. 

60. A primer, comprising at least about 20, preferably about 1 6, bases of any of the sequence of nucleotides sequences 
10 set forth in SEQ ID NOs. 32-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128 and 129. 

61 . A primer, comprising at least about 20, preferably about 1 6, bases of any of the sequence of nucleotides sequences 
set forth in SEQ ID NOs. 1-22, 24, and 27-32. 

is 62. A primer of claim 60 or 61 that is unlabeled, and optionally includes a mass modifying moiety, which is preferably 
attached to the 5'end. 

63. The method of any of claims 1-17 and the process of any one of claims 1 8-59, wherein a nucleic acid is immobilized 
to a solid support via a selectively cleavabie linker. 



20 
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64. The method or the process of claim 63, wherein the linker is thermocleavable, enzymatically cleavabie, photo- 
cleavable or chemically cleavabie. 

65. The method or the process of claim 63, wherein the linker is a trityl linker. 

66. The method or the process of claim 65, wherein the linker is selected from the group consisting of 1-(2-nitro-5- 
(3-0-4,4 , -dimethoxytritylpropoxy) phenyl)- 1 -0-((2-cyanoethoxy)-diisopropylaminophosphino)ethane and 1 -(4- 
(3-0-4,4'-dimetho)rytrltylpropoxy)-^ 

ethane. 

67. The method or process of any of claims 1-59 and 63-66 wherein a primer is a peptide nucleic acid. 

68. The primer of claims 60-62 that is a peptide nucleic acid. 
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SEQUENCE OF THE AMPLIFIED 209 bp PCR-PRODUCT OF "ME /J-GLOBN 



FORWARD PRIMER: 02 
CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA 

t2mer PRIMER 

TR dTBfcAftflT i&ACTCfc TGTO GAGAAGTCTG CCGTTACTGC CCTGTGGGGC 

MGGTCAACG TGGATGAAST TCGTCGTGAG GXCTGGGCA GGTTGGTATC 
AAGCTTACAA GACAGGTTTA AGGAGACCAA TAGAAACTGG GCATCTGGAG 

ACAGAGAAG 
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